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Abstract 

The "principle of the fermionic projector" provides a new mathematical frame- 
work for the formulation of physical theories and is a promising approach for 
physics beyond the standard model. The book begins with a brief review of 
relativity, relativistic quantum mechanics and classical gauge theories, with the 
emphasis on the basic physical concepts and the mathematical foundations. The 
external field problem and Klein's paradox are discussed and then resolved by 
introducing the so-called fermionic projector, a global object in space-time which 
generalizes the notion of the Dirac sea. The mathematical core of the book is 
to give a precise definition of the fermionic projector and to employ methods of 
hyperbolic differential equations for its detailed analysis. The fermionic projector 
makes it possible to formulate a new type of variational principles in space-time. 
The mathematical tools for the analysis of the corresponding Euler-Lagrange 
equations are developed. A particular variational principle is proposed which 
gives rise to an effective interaction showing many similarities to the interactions 
of the standard model. 

The main chapters of the book are easily accessible for beginning graduate 
students in mathematics or physics. Several appendices provide supplementary 
material which will be useful to the experienced researcher. 
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Preface 



The basic ideas behind the "principle of the fcrmionic projector'" go back to the 
years 1990-91 when I was a physics student in Heidelberg. At that time, I was excited 
about relativity and quantum mechanics, in particular about classical Dirac theory, 
but I felt uncomfortable with quantum field theory. The dissatisfaction with second 
quantized fields, which was in part simply a beginner's natural skepticism towards an 
unfamiliar physical concept, was my motivation for thinking about possible alterna- 
tives. Today I clearly understand quantum field theory much better, and many of my 
early difficulties have disappeared. Nevertheless, some of my objections remain, and 
the idea of formulating physics in a unified way based on Dirac 's original concept of a 
"sea of interacting particles" seems so natural and promising to me that I have pursued 
this idea ever since. It took quite long to get from the first ideas to a consistent theory, 
mainly because mathematical methods had to be developed in order to understand the 
"collective behavior" of the particles of the Dirac sea. 

This book gives me the opportunity to present the main ideas and methods in a 
somewhat broader context, with the intention of making this area of mathematical 
physics accessible to both theoretical physicists and applied mathematicians. The 
emphasis of the main chapters is on the conceptual part, whereas the more technical 
aspects are worked out in the appendices. 

I am grateful to Claus Gerhardt, Joel Smoller, Shing-Tung Yau and Eberhard 
Zeidler for their encouragement and support. I would like to thank Stefan Hoch, Niky 
Kamran, Johann Kronthaler, Watzold Plaum and Joel Smoller for helpful comments, 
and Eva Riitz for the typesetting. Finally, I am grateful to the Max Planck Institute 
for Mathematics in the Sciences, Leipzig, and the Morningside Center, Beijing, for 
their hospitality while I was working on the manuscript. 

Felix Finster, Regensburg, November 2004 
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Preface to the Online Edition 



In the few years since the book appeared, I was frequently asked if the introduetory 
chapters were also available online. Also, I heard complaints that the preprints on the 
arXiv on the "principle of the fermionic projector" were preliminary versions which 
were not quite compatible with the book and with subsequent papers. In order to 
improve the situation, I decided to replace my original preprints hep-th/0001048, hep- 
th/0202059 and hep-th/0210121 by the corresponding chapters of the book. 

I took the opportunity to correct some typos. I also added a few footnotes begin- 
ning with "Online version:" which point out to later developments. Apart from these 
changes, the present online version coincides precisely with the book in the AMS/IP 
series. In particular, all equation numbers are the same. 

Felix Finster, Regensburg, October 2009 
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CHAPTER 



The Principle of the Fermionic Projector — A New 
Mathematical Model of Space-Time 

The mathematical model of space-time has evolved in history. In Newtonian me- 
chanics, space is described by a Euclidean vector space. In special relativity, space and 
time were combined to Minkowski space, a vector space endowed with a scalar product 

of signature (H ). In general relativity, the vector space structure of space-time 

was given up on the large scale and was replaced by that of a Lorentzian manifold. The 
first hint that the notions of space and time should be modified also on the microscopic 
scale was obtained by Planck, who observed that the gravitational constant, Planck's 
constant and the speed of light give rise to a quantity of the dimension of length. 



and he conjectured that for distances as tiny as this so-called Planck length, the 
conventional laws of physics should no longer hold, and yet unknown physical effects 
might become significant. Later, this picture was confirmed by quantum field theory. 
Namely, due to the ultraviolet divergences, pcrturbativc QFT is well-defined only 
after regularization, and the regularization is then removed using the renormalization 
procedure. While renormalization ensures that the observable quantities do not depend 
on the regularization, the theoretical justification for the renormalization program lies 
in the assumption that the continuum theory should be valid only down to some 
microscopic length scale, and it seems natural to associate this length scale to the 
Planck length. 

Today most physicists agree that in order to make progress in fundamental physics, 
one should go beyond the continuum field theory and try to get a better understanding 
of the microscopic structure of space-time. However, giving up the usual space-time 
continuum leads to serious problems, and this is one reason why there is no consensus 
on what the correct mathematical model for "Planck scale physics" should be. Let 
us illustrate the difficulties by briefly discussing a few of the many approaches. The 
simplest and maybe most natural approach is to assume that on the Planck scale 
space-time is no longer a continuum but becomes in some way "discrete." This idea 
is for example used in lattice gauge theories, where space-time is modeled by a four- 
dimensional lattice (see Figure 0.1(a)). Using the specific structures of a lattice like 
the nearest-neighbor relation and the lattice spacing d, one can set up a quantum field 
theory which is ultraviolet finite [Ro]. Lattice gauge theories are very useful for nu- 
merical simulations [K]. However, they are not fully satisfying from a conceptual point 
of view because a space-time lattice is not consistent with the equivalence principle 
of general relativity. Namely, if one considers the lattice in the reference frame of an 
accelerated observer (denoted in in Figure 0.1(b) by (t',x')), the lattice points are no 
longer in a regular configuration. Thus the structure of a lattice is not invariant under 
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Figure 0.1. A lattice regularization for two different observers. 



general coordinate transformations and hence is not compatible with the equivalence 
principle. 

An alternative approach is to hold on to a space-time continuum, but to work with 
objects which are spread out in space-time, thereby avoiding the ultraviolet problems 
which occur for very small distances. The most prominent example in this direction is 
string theory, where physics is on a fundamental level described by so-called strings, 
which are extended in space-time and are therefore ultraviolet finite. The basic problem 
with such theories is that they are formulated using the structures of an underlying 
continuum space-time (like the vector space structure, the topology or even a metric), 
although all observable quantities (like the Lorentz metric, particles, fields, etc.) are to 
be derived from the non-localized objects, without referring to the underlying space- 
time continuum. Therefore, the structures of the underlying "continuum background" 
may seem artificial, and serious conceptual problems arise when these background 
structures are not compatible with basic physical principles (for example, a background 
vector space is not compatible with the equivalence principle). For short, one says 
that the theory is not background-free (for a more detailed discussion see [Ba] and the 
references therein). 

Thus one difficulty in finding a promising model for Planck scale physics is that 
it should be background-free and should respect the basic physical principles (like the 
equivalence principle, the local gauge principle, etc.). There are approaches which 
actually meet these requirements. One is Connes' noncommutative geometry. As 
pointed out by Grothendieck, there is a one-to-one correspondence between the points 
of a manifold and the prime ideals of the (commutative) algebra of functions on this 
manifold. Thus the geometry of a manifold can be recovered from an underlying 
algebraic structure, and this makes it possible to extend the notions of space and time 
by considering more general, noncommutative algebras (see [Co, CC] for details and 
physical applications). The other approach is quantum gravity as pursued by Ashtekar 
and his school [ARS, Th]. The hope is that the ultraviolet divergences of QFT should 
disappear as soon as gravity, quantized in a non-perturbative way, is included. 

Ultimately, a model for space-time on the Planck scale must be verified or falsified 
by physical experiments. Unfortunately, experiments on the Planck scale would require 
such enormously high energies that they are at present out of reach. Indirect experi- 
ments seem possible in principle [ARA] but have so far not been carried out. In my 
opinion, one should not hope for important new experimental input in the near future, 
but one should try to make due with the experimental data which is now available. 
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Indeed, this situation is not as hopeless as it might appear at first sight. Namely, in 
present physical models like the standard model, a lot of information from experiments 
is built in empirically, like the masses of the elementary particles, the gauge groups, 
the coupling constants, etc. Therefore, one can get a connection to experiments simply 
by trying to reproduce this well known empirical data. If successful, this procedure 
could give strong physical evidence for a model. For example, even if based on ad- 
hoc assumptions on the microscopic structure of space-time (which cannot be verified 
directly), a model would be very convincing and physically interesting if it gave the 
correct value for the fine structure constant and explained e.g. why the strong gauge 
group is 5*^7(3) or why neutrinos do not couple to the electromagnetic field. Thus the 
goal of a mathematical model for space-time on the Planck scale is to give a more 
fundamental explanation for the structures and empirical parameters in the standard 
model. To our opinion, only such concrete results can justify the model. Clearly, it is 
far from obvious how a promising model should look, or even in which mathematical 
framework it should be formulated. But at least for a mathematician, this makes the 
problem only more interesting, and it seems a challenging program to search for such 
models and to develop the mathematical methods needed for their analysis. 

Our point of view that the mathematical model needs justification by known exper- 
imental data is not just a requirement which the model should fulfill at the very end, 
but it also gives a few hints on how one should proceed in order to find a promising 
model. First of all, one can expect concrete results only if one makes specific assump- 
tions. Therefore, generalizing the notion of a Lorentzian manifold does not seem to 
be sufficient, but one should make a concrete ansatz for the microscopic structure of 
space-time (as it is done e.g. in string theory and lattice gauge theories). Furthermore, 
in order to make it easier to get a connection to well-established theories like classi- 
cal field theory and quantum mechanics, it seems a good idea to take these theories 
as the starting point and to try to work as closely as possible with the objects used 
in these theories. Namely, if one drops important structures of the classical theories 
and/or introduces too many new structures ad hoc, it might become very difficult if 
not impossible to obtain a relation to observable data. 

In our model of space-time we have tried to follow the above considerations. Our 
starting point is relativistic quantum mechanics and classical field theory. We assume 
that space-time is discrete on the Planck scale. But our notion of "discrete space-time" 
is much more general than a four-dimensional lattice; in particular, we do not assume 
any discrete symmetries in space-time, we keep the local gauge freedom, and we also 
extend the diffeomorphism invariance of a manifold in such a way that the equivalence 
principle is respected in discrete space-time. Furthermore, our model is background- 
free. In contrast to string theory, we do not introdTice any new objects, but hold on to 
the structures already present in classical Dirac theory. We build in our physical ideas 
simply by prescribing which of these structures we consider as being fundamental, and 
then carry over these structures to discrete space-time. In the resulting mathematical 
framework, it is impossible to formulate the conventional physical equations, and thus 
we propose instead new equations of different type, referred to as the equations of 
discrete space-time. In a certain limiting case, the so-called continuum limit, we get 
a connection to the conventional formulation of physics in a space-time continuum. 
We point out that, in contrast to the Ashtekar program, we do not work with second 
quantized fields. But our concept is that the equations of discrete space-time should 
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also account for the physical effects of quantized fields if one goes beyond the continuum 
limit. 

More specifically, wc describe the physical system by the fermionic projector P{x, y), 
which can be regarded as the projector on all occupied fermionic states of the system, 
including the states of the Dirac sea. After carrying over the fermionic projector to 
discrete space-time, we can set up variational principles like our "model variational 
principle" 

^ C[P{x,y) P{y,x)] min, 

x,yeM 

where the "Lagrangian" jC is given by 

£[A] = \A'\-,^\A\\ 

with fi a Lagrangian multiplier. Here 1^41 is the so-called spectral weight defined as 
the sum of the absolute values of the eigenvalues of the matrix A (or, in case that A is 
not diagonalizable, of the zeros of its characteristic polynomial). We study the above 
variational principle for a fermionic projector which in the vacuum is the direct sum 
of seven identical massive sectors and one massless left-handed sector, each of which 
is composed of three Dirac seas. Analyzing the continuum limit for an interaction via 
general chiral and (pseudo) scalar potentials, we find that the sectors spontaneously 
form pairs, which are referred to as blocks. The resulting so-called effective interaction 
can be described by chiral potentials corresponding to the effective gauge group 

SU{2) X SU{3) X U{lf . 

This model has striking similarity to the standard model if the block containing the 
left-handed sector is identified with the leptons and the three other blocks with the 
quarks. Namely, the effective gauge fields have the following properties. 

• The SU{3) corresponds to an unbroken gauge symmetry. The SU (3) gauge 
fields couple to the quarks exactly as the strong gauge fields in the standard 
model. 

• The SU{2) potentials are left-handed and couple to the leptons and quarks 

exactly as the weak gauge potentials in the standard model. Similar to the 
CKM mixing in the standard model, the off-diagonal components of these 
potentials must involve a non-trivial mixing of the generations. The SU{2) 
gauge symmetry is spontaneously broken. 

• The U{1) of electrodynamics can be identified with an Abelian subgroup of 
the effective gauge group. 

The effective gauge group is larger than the gauge group of the standard model, but 
this is not inconsistent because a more detailed analysis of our variational principle 
should give further constraints for the Abelian gauge potentials. Moreover, there 
are the following differences to the standard model, which we derive mathematically 
without working out their physical implications. 

• The SU{2) gauge field tensor F must be simple in the sense that F = As for 
a real 2-form A and an su(2)-valued function s. 

• In the lepton block, the off-diagonal SU (2) gauge potentials are associated 
with a new type of potential, called nil potential, which couples to the right- 
handed component. 

These results give a strong indication that the principle of the fermionic projector is of 
physical significance. Nevertheless, the goal of this book is not to work out our model 
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variational principle in all details. Our intention is to develop the general concepts 
and methods from the basics, making them easily accessible to the reader who might 
be interested in applying them to other equations of discrete space-time or to related 
problems. 

These notes are organized as follows. In order to make the presentation self- 
contained, Chapter 1 gives a brief account of the mathematical and physical prelim- 
inaries. Chapter 2 introduces the fcrmionic projector in the continuum and provides 
the mathematical methods needed for its detailed analysis. In Chapter 3 we go be- 
yond the continuum description and introduce our mathematical model for space-time 
on the Planck scale. In Chapter 4 we develop a mathematical formalism suitable for 
the analysis of the continuum limit. In Chapter 5 we present and discuss different 
equations of discrete space-time in the vacuum, and we choose the most promising 
equations as our "model equations". In the last Chapters 6-8 we analyze interacting 
systems in the continuum limit. The appendices contain additional material and will 
be referred to from the main chapters. 



CHAPTER 1 



Preliminaries 

1.1. Relativity 

In this section we briefly outline the mathematical framework of special and general 
relativity (for a more detailed introduction see [Wo] and [Wa]). We always work 
in normal units where h = c = 1. In special relativity, space-time is described by 
Minkowski space (M, (.,.)), a real 4-dimensional vector space endowed with an inner 

product of signature (H ). Thus, choosing a pseudo-orthonormal basis (ei)j=o,...,3 

and representing the vectors of M in this basis, ^ = Yl^=oC^ij the inner product takes 
the form 

3 

= E 9jkev', (i-i-i) 

j,k=0 

where gij, the Minkowski metric, is the diagonal matrix g = diag (1, — 1, — 1, — 1). In 
what follows we usually omit the sums using Einstein's summation convention (i.e. we 
sum over all indices which appear twice, once as an upper and once as a lower index). 
Also, we sometimes abbreviate the Minkowski scalar product by writing ^ij := rj) 
and := {^,^). A pseudo-orthonormal basis (e,)i=o,...,3 is in physics called a reference 
frame, because the corresponding coordinate system (x*) of Minkowski space gives the 
time and space coordinates for an observer in a system of inertia. We also refer to 
t := as time and denote the spatial coordinates by 

The sign of the Minkowski metric encodes the causal structure of space-time. 
Namely, a vector ^ G M is said to be 

timelike if ^) > "| 

spacelike if (^,^) < > 
null if = 0- J 

Likewise, a vector is called non-spacelike if it is timelike or null. The null vectors form 
the double cone L = e M \ {^,0 = 0}, referred to as the light cone. Physically, 
the light cone is formed of all rays through the origin of M which propagate with the 
speed of light. Similarly, the timelike vectors correspond to velocities slower than light 
speed; they form the interior light cone I = ^ M \ {^,^) > 0}. Finally, we introduce 
the closed light cone J = G M | ^) > 0}. The space-time trajectory of a moving 
object describes a curve g(r) in Minkowski space (with r an arbitrary parameter). We 
say that the space-time curve q is timelike if the tangent vector to q is everywhere 
timelike. Spacelike, null, and non-spacelike curves are defined analogously. The usual 
statement of causality that no information can travel faster than with the speed of 
light can then be expressed as follows, 

causality: information can be transmitted only along 
non-spacelike curves. 



7 



8 



1. PRELIMINARIES 



The set of all points which can be joined with a given space-time point a; by a non- 
spacelike curve is precisely the closed light cone centered at x, denoted by Jx '■= J — x. 
It is the union of the two single cones 

= {yGM|(y-rr)2>0, (yO-x°)>0} 
= {yeM\{y- xf > 0, (y° - a;°) < 0} , 

which have the interpretation as the points in the causal future and past of x, respec- 
tively. Thus we refer to and as the closed future and past light cones centered 
at X, respectively. Similarly, we also introduce the sets , and L^, L^. 

A linear transformation of Minkowski space which leaves the form of the Minkowski 
metric (1.1.1) invariant is called a Lorentz transformation. The Lorentz transforma- 
tions form a group, the Lorentz group. The Lorentz transformations which preserve 
both the time direction and the space orientation form a subgroup of the Lorentz 
group, the orthochronous proper Lorentz group. 

The physical equations should be Lorentz invariant, meaning that they must be 
formulated in Minkowski space, independent of the reference frame. A convenient 
way of making Lorentz invariance manifest is to bring the equations in tensorial form 
(see [L] for a good introduction). Writing out the tensor indices, we get upper "con- 
travariant" and lower "covariant" indices, which can be transformed into each other by 
contracting with the metric, e.g. = gjkC^ and = g^^^i with g^^ = {gki)~^- In order 
to formulate electrodynamics in a manifestly Lorentz invariant form, one combines the 
electric potential and the vector potential to a 1-form A = Ajdx^ , the electromagnetic 
potential. The electric and magnetic fields are then components of the electromagnetic 
field tensor F defined by 

F = dA or, in components, Fj^ = djA^ — d^Aj . 

The Maxwell equations take the form 

dkF^' = Cj' , (1.1.2) 

where j is the so-called 4-current and the constant C = Aire involves the electromag- 
netic coupling constant (we use the sign convention where e > 0). For an observer in 
a reference frame, the time and spatial components j° and j of the 4-current have the 
interpretation as the electric charge density and the electric current density, respec- 
tively. Since we shall always work in the 4-dimensional setting, it is unambiguous to 
refer to j simply as the electromagnetic current. Now consider a point particle of mass 
m and unit charge e in a given (=external) electromagnetic field. Since by causality 
its velocity is always smaller than light speed, its trajectory is a timelike curve ^(r). 
Thus we can parametrize it by the arc length, i.e. 

(■u, u) = 1 with u(r) := -^-^'(t) . 

UT 

In a reference frame, the time and spatial components of the vector m-uij) are the 
energy and momentum of the particle at the space-time point ^(t). We refer to mu 
as the momentum of the particle. The parameter r has the interpretation as the 
proper time of an observer moving along q. The equation of motion is the tensor 
equation ni-^u^ = —eF^^Uk- Since we shall only consider particles of unit charge, it 
is convenient to remove the parameter e from the eqiiation of motion. To this end, we 
rescale the electromagnetic potential according to A e^^A. Then the equation of 
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motion simplifies to 

m-^u^ = -F'J'^Uk, (1.1.3) 
dr 

whereas the constant C in the Maxweh equations (1.1.2) becomes C = 47re^. 

The starting point for general relativity is the observation that a physical process 
involving gravity can be understood in different ways. Consider for example an ob- 
server at rest on earth looking at a freely falling person (e.g. a diver who just jumped 
from a diving board). The observer at rest argues that the earth's gravitational force, 
which he can feel himself, also acts on the freely falling person and accelerates him. 
The person at free fall, on the other hand, does not feel gravity. He can take the point 
of view that he himself is at rest, whereas the earth is accelerated towards him. He 
then concludes that there are no gravitational fields, and that the observer on earth 
merely feels the force of inertia corresponding to his acceleration. Einstein postulated 
that these two points of view should be equivalent descriptions of the physical process. 
More generally, it depends on the observer whether one has a gravitational force or an 
inertial force. In other words, 

equivalence principle: no physical experiment can distin- 
guish between gravitational and in- 
ertial forces. 

In mathematical language, observers correspond to coordinate systems, and so the 
equivalence principle states that the physical equations should be formulated in general 
(i.e. "curvilinear" ) coordinate systems, and should in all these coordinate systems have 
the same mathematical structure. This means that the physical equations should be 
invariant under diffeomorphisms, and thus space-time is to be modeled by a Lorentzian 
manifold (M, g). 

A Lorentzian manifold is "locally Minkowski space" in the sense that at every 
space-time point p G M, the corresponding tangent space TpM is a vector space en- 
dowed with a scalar product (., .)p of signature (-1 ). Therefore, we can distin- 
guish between spacelikc, timelike and null tangent vectors. Defining a non-spacclikc 
curve g(r) by the condition that its tangent vector ■u(r) G T^j-^^M be everywhere non- 
spacelike, our above definition of light cones and the notion of causality immediately 
carry over to a Lorentzian manifold. In a coordinate chart, the scalar product {■,-)p 
can be represented in the form (1.1.1) where gjk is the so-called metric tensor. In 
contrast to Minkowski space, the metric tensor is not a constant matrix but depends 
on the space-time point, gjk = gjkip)- Its ten components can be regarded as the 
relativistic analogue of Newton's gravitational potential. For every p G M there are 
coordinate systems in which the metric tensor coincides with the Minkowski metric up 
to second order, 

gjkip) = diag(l,-l,-l,-l) , djgkiip) = 0. (1-1-4) 

Such Gaussian normal coordinates correspond to the reference frame of a "freely falling 
observer" who feels no gravitational forces. However, it is in general impossible to 
arrange that also djkgim{p) = 0. This means that by going into a suitable reference 
frame, the gravitational field can be transformed away locally (=in one point), but 
not globally. With this in mind, a reference frame corresponding to Gaussian normal 
coordinates is also called a local inertial frame. 

The equation of motion (1.1.3) and the Maxwell equations (1.1.2) can easily be 
formulated on a Lorentzian manifold by the prescription that they should in a local 
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inertial frame have the same form as in Minkowski space; this is referred to as the 
strong equivalence principle. It amounts to replacing all partial derivatives by the 
corresponding covariant derivatives V of the Levi-Civita connection; we write symbol- 
ically 

d — ^ V. (1.1.5) 

We thus obtain the equations 

mVrU^ = -F^^Uk, VkF^'' = Aire^ f (1.1.6) 

with Fjk = {dA)jk = VjAk - VkAj. 

The gravitational field is described via the curvature of space-time. More precisely, 
the Riemannian curvature tensor is defined by the relations 

Ri^^u^ = VjVku' -VkVju\ (1.1.7) 

Contracting indices, one obtains the Ricci tensor Rjk = R^jik scalar curvature 

R = R-j. The relativistic generalization of Newton's gravitational law are the Einstein 
equations 

Rjk-^Rgjk = SiTKTjk, (1.1.8) 

where k is the gravitational constant. Here the energy-momentum tensor Tjk gives the 
distribution of matter and energy in space-time. 

It is very convenient that the physical equations can all be derived from a varia- 
tional principle. To this end, one considers the action (see e.g. [LL]) 

S = / + + |^^(-_L,F,,F^'.--i-i;)<i,., (i.i.i,, 

where u = c'{t) is the tangent vector of a timelike curve, and dfi := \/ — dct g d'^x 
is the integration measure on M. This action is not bounded below, but one can 
nevertheless look for stationary points and derive the corresponding Euler-Lagrange 
equations. Varying the space-time curve, the electromagnetic potential and the metric 
yield the equations of motion, the Maxwell equations and the Einstein equations, 
respectively. 

1.2. Relativistic Quantum Mechanics 

We now give an elementary introduction to relativistic quantum mechanics in Min- 
kowski space (for more details see [BDl, T]). According to the Heisenberg Uncertainty 
Principle, the position and momentum of a quantum mechanical particle cannot be 
determined simultaneously, making it impossible to describe the particle by a trajec- 
tory in space-time. Instead, one uses a wave function ^'(t,x), whose absolute square 
has the interpretation as the probability density that the particle is at position x. The 
simplest relativistic wave equation is the Klein-Gordon equation 

(-□-m^)* = 0, (1.2.1) 

where □ = djd^ is the wave operator. This equation describes a scalar particle (=par- 
ticle without spin) of mass m. If the particle has electric charge, one needs to suitably 
insert the electromagnetic potential A into the Klein-Gordon equation. More precisely, 
one finds empirically that the equation 

-{dk - iAk){d^ - iA^) ^ = (1.2.2) 
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describes a scalar particle of mass m and charge e in the presence of an electromagnetic 
field. 

In order to describe a particle with spin, it was Dirae's idea to work with a first 
order differential operator whose square is the wave operator. One introduces the 
Dirac matrices 7'' as 4 x 4-matrices which satisfy the anti- commutation relations 

2ff'''l = {7^ t'^} = 7V + 7V- (1-2.3) 

Then the square of the operator jWj is 

(7^5,)2 = 7V9A = ^Wn^jdjk = □. (1.2.4) 
For convenience, we shall always work in the Dirac representation 

/^(;°y (1.2.5) 



^ -1 / ' ' V -<? 

where a are the three Pauli matrices 



loy \ i r vo-i 

The Dirac equation in the vacuum reads 

i7''^-n?j^{x) = 0, (1.2.6) 

where ^'(x), the Dirac spinor, has four complex components. The leptons and quarks 
in the standard model are Dirac particles, and thus one can say that all matter is 
on the fundamental level described by the Dirac equation. Multiplying (1.2.6) by the 
operator (ij^dj + m) and using (1.2.4), one finds that each component of ^ satisfies 
the Klein- Gordon equation (1.2.1). In the presence of an electromagnetic field, the 
Dirac equation must be modified to 

i-f''{dk-iAk)^ = m^. (1.2.7) 

Multiplying by the operator (i^^{dj — iAj) + m) and using the anti-commutation 
relations, we obtain the equation 



* = 0. 



This differs from the Klein-Gordon equation (1.2.2) by the extra term ^Fj^j^^^, which 
describes the coupling of the spin to the electromagnetic field. We also denote the 
contraction with Dirac matrices by a slash, i.e. ^ = j^uj for u a vector of Minkowski 
space and ^ = ^yWj. 

The wave functions at every space-time point are endowed with an indefinite scalar 
product of signature (2, 2), which we call spin scalar product and denote by 

4 

I ^y{x) = 51 *°(X)* $"(X) , si = S2 = 1, S3 = S4 = -1 , (1-2.8) 

a=l 

where \I'* is the complex conjugate wave function (this scalar product is often written 
as with the so-called adjoint spinor = ^'*7°). By the adjoint A* of a matrix A 
we always mean the adjoint with respect to the spin scalar product as defined via the 
relations 

^yl*^ = -<;^ I A^y for all 
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In an obvious way, this definition of the adjoint gives rise to the notions "selfadjoint" 
"anti-selfadjoint" and ^^unitary." With these notions, the Dirac matrices are selfad- 
joint, meaning that 

^7'* I = I 7'$;^ for all 
To every solution * of the Dirac equation we can associate a time-like vector field J 

by 

J*^ = I 7'^ qiy , (1.2.9) 
which is called the Dirac current. The Dirac current is divergence-free, 

9^ J*^ = dk I 7^" = ~<dk^ I 7^" + I j'^dk 
= i{~<i(^^\^> <^ I i^*^) 

= i(-<(i^ + 4- "T-)^ I <* I (i^-F 4 - H*^) = 0) 

this is referred to as current conservation. 

So far Dirac spinors were introduced in a given reference frame. In order to verify 
that our definitions are coordinate independent, we consider two reference frames (x^) 
and (x') with the same orientation of time and space. Then the reference frames are 
related to each other by an orthochronous proper Lorentz transformation A, i.e. in 
components 

= A'- x^ , A' — = — , 

and A leaves the Minkowski metric invariant, 

A^- K dim = 9jk . (1.2.10) 

Under this change of space-time coordinates, the Dirac operator i'j^ldj.j — lAj) trans- 
forms to 

if (^-^ - iAi^ with f = A''jj^ . (1.2.11) 

This transformed Dirac operator does not coincide with the Dirac operator ij\d^i — 
iAi) as defined in the reference frame (x^) because the Dirac matrices have a different 
form. However, the next lemma shows that the two Dirac operators do coincide up to 
a suitable unitary transformation of the spinors. 

Lemma 1.2.1. For any orthochronous proper Lorentz transformation A there is a 
unitary matrix U{A) such that 

U{A) A^.7^' C/(A)-i = y . 

Proof. Since A is orthochronous and proper, we can write it in the form A = 
exp(A) , where A is a suitable generator of a rotation and/or a Lorentz boost. Then A{t) := 
exp(tA), t G M, is a family of Lorentz transformations, and differentiating (1.2.10) with 
respect to t at t = 0, we find that 

Aj- gik = -gjm Afe* . 

Using this identity together with the fact that the Dirac matrices are selfadjoint, it is 
straightforward to verify that the matrix 
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is anti-selfadjoint. As a consequence, the family of matrices 

U(t) := exp(tu) 

is unitary. We now consider for a fixed index / the family of matrices 

A{t) := U{t)k{t)\^^U{t)-\ 
Clearly, A(0) = 7'. Furthermore, differentiating with respect to t gives 

I A{t) = U K] [u r' u + A^t'} U-^ , 

and a short calculation using the commutation relations 

[ink,i^ = 2 (ji g''^ - dj j''^ 

shows that the curly brackets vanish. We conclude that A{1) = A{0), proving the 
lemma. ■ 

Applying this lemma to the Dirac operator in (1.2.11), one sees that the Dirac oper- 
ator is invariant under the joint transformation of the space-time coordinates and the 
spinors 

— > A{x'' , ^' — > [/(A) . (1.2.12) 

Moreover, since the matrix U{A) is unitary, the representation of the spin scalar prod- 
uct (1.2.8) is valid in any reference frame. We conclude that our definition of spinors 
is indeed coordinate invariant. 

Out of the Dirac matrices one can form the pseudoscalar matrix p by 

P = ^ ejklmlhWl"' (1-2.13) 

(this matrix in the physics literature is usually denoted by 7^). Here ejkim is the 
totally antisymmetric symbol (i.e. ejkim is equal to ±1 if (j, k, I, m) is an even and odd 
permutation of (0,1,2,3), respectively, and vanishes otherwise). A short calculation 
shows that the pseudoscalar matrix is anti-selfadjoint and = 1. As a consequence, 
the matrices 

XL = \{1-P). XR=\{-^+p) (1-2.14) 

satisfy the relations 

X\/R = Xl/R , PXL = -XL , PXR = XR , X*L = XR , XL + Xi? = 1 - 

They can be regarded as the spectral projectors of the matrix p and are called the 
chiral projectors . The projections xl^ and Xr^ are referred to as the left- and right- 
handed components of the spinor. A matrix is said to be even and odd if it commutes 
and anti-commutes with p, respectively. It is straightforward to verify that the Dirac 
matrices are odd, and therefore 

7^ Xl/r = Xr/l 7^ • 

Using this relation, one can rewrite the Dirac equation (1.2.7) as a system of equations 
for the left- and right-handed components of 

i-f^{dk-iAk)xL'^ = mxi?^, i-/'' {dk - iAk) xr"^ = mxL^ ■ 



14 



1. PRELIMINARIES 



If m = 0, these two equations decouple, and we get separate equations for the left- 
and right-handed components of This observation is the starting point of the 2- 
componcnt Wcyl spinor formahsm. Wc shall not use this formalism here, but will 
instead describe chiral massless particles (like neutrinos) by the left- or right-handed 
component of a Dirac spinor. 

For the probabilistic interpretation of the Dirac wave function, we need to dis- 
tinguish a direction of time and work in a particular reference frame. Then the zero 
component of the Dirac current J^{t, x) has for a given time t the interpretation as the 
probability density of the particle to be at position x (and is thus the relativistic ana- 
logue of the absolute square l^'p of the Schrodinger or Pauli wave functions). Clearly, 
for this probabilistic interpretation the wave function must be properly normalized. 
More precisely, physical states must satisfy the normalization condition 

/ ^*|7°*^(t,f)dx = 1 . (1.2.15) 
7ir3 

The integral in (1.2.15) is also called the probability integral. Using Gauss' (divergence) 
theorem and the current conservation, one sees that the normalization integral is time 
independent, 



/ ^y{t2,x) dx - I'j'^ •^y{ti,x) dx 

7ir3 Jir3 

= f^dtf dxdk^^\j''^y{t,x) = 0, (1.2.16) 



and thus it suffices to satisfy (1.2.15) for example at t = 0. 

In a given reference frame, it is convenient to introduce a positive scalar product 
by polarizing the normalization integral, 

(*|$) := / X* |7°$^(t,f)df . (1.2.17) 

ilR3 

We denote the Hilbert space corresponding to this scalar product hy Ti = L^(R^)^. 
Multiplying the Dirac equation (1.2.7) by 7*^ and bringing the i-derivative on a separate 
side of the equation, we can write the Dirac equation as 

idt"^ = (1.2.18) 

with a purely spatial operator h. Clearly, this equation is not manifestly covariant. In 
analogy to nonrelativistic quantum mechanics, it is referred to as the Dirac equation 
in Hamiltonian form, and h is the Harniltonian. If ^ and <I> are solutions of the Dirac 
equation, one sees similar to (1.2.16) that the scalar product (1.2.17) is independent 
of time. Hence 

= dti'i' \^) = i {{h'^ I $) - (^' I h^)) . 

This shows that the Hamiltonian is a symmetric operator on Ti.. 

We conclude this section by a brief discussion of the solutions of the free Dirac 
equation (=the Dirac equation without electromagnetic field) in the case m 7^ 0. 
Taking the Fourier transform of the wave function. 
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the Dirac equation {i^ — m)'^ = reduces to the algebraic equation in momentum 
space 

{ljl-m)^{k) = 0. (1.2.19) 

Multiplying by ^ + m and using the identity {fj( — m){lj + m) = k'^ — m^, one sees 
that if fe^ 7^ m^, the matrix ^ — m is invertible and thus (1.2.19) has no solutions. If 
conversely = m^, we have the relation {Jjl — rnf' = —2m{lf. — m), showing that the 
matrix ^ — m is diagonalizable and that its eigenvalues are either —2m or zero. Taking 
the trace, Tr(^ — m) = —4m, it follows that the matrix If, — m has a two-dimensional 
kernel. A short calculation shows that the projector onto this kernel is given by 

nW := . (1.2.20) 

We conclude that (1.2.19) has a solution only if k is on the mass shell {k\k'^ = m?}. 
For each k on the mass shell, (1.2.19) has exactly two linearly independent solutions. 
In order to give these solutions more explicitly, we choose a reference frame {t, x) and 
denote the corresponding momentum variables by k = {uj,p). The momenta on the 
mass shell are then given by 

u = Lo{p, e) := e -\/PP~+^^ 

with parameters peM.^ and e G {±1}- The momenta with e = 1 and e = — 1 are said 

to be on the upper and lower mass shell, respectively. For any given {p, e), we label the 
two linearly independent solutions of (1.2.19) by a parameter s € {1,2} and denote 
them by Xpse- It is most convenient to choose them pseudo-orthonormal with respect 
to the spin scalar product, 

■<Xpse,Xps'ey = e Ss,s' for all p G M^ e G {±1} and s, s' G {1, 2} . (1.2.21) 

Here the factor e reflects that the solution spaces on the upper and lower mass shell 
are positive and negative definite, respectively. Using a bra/kct notation in the spin 
scalar product, we get a simple representation of the projector (1.2.20), 

^IXpse^^Xpsel = U{iuip,e),p). (1.2.22) 

s=l,2 

The spinors xpse form a complete set of solutions of (1.2.19). Taking their suitably 
normalized Fourier transform, we obtain the plane wave solutions 

%~seit,S) = ®^)*+*^^-^Xp-.e. (1.2.23) 

(27r)2 

Each solution of the free Dirac equation is a superposition of plane wave solutions. 

In the Hamiltonian framework (1.2.18), the plane wave solutions are eigenfunctions 
of the Hamiltonian with eigenvalue Lo{p, e). 

Since the eigenvalue of the Hamiltonian has the interpretation as the physical energy of 
the state, we conclude that the plane wave solutions on the upper and lower mass shell 
have positive and negative energy, respectively. Expressed more mathematically, the 
plane wave solutions correspond to points in the essential spectrum of the Hamiltonian, 
and thus 

aess{h) = {±^/^~+m^, peM.^} = (-oo, -m] U [m, oo) . 
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In particular, we conclude that the Dirac equation has solutions of negative energy and 
that the Hamiltonian is not bounded below. This was originally considered a serious 
problem of Dirac theory, mainly because a system with unbounded Hamiltonian has no 
stable ground state. Dirac resolved these problems by introducing the so-called Dirac 
sea [D2] . The concept of the Dirac sea plays a crucial role in the present work. At this 
point, we merely explain Dirac's idea in words (in the next section we shall explain 
how it is implemented mathematically in the framework of quantum field theory, and 
in Chapter §2 we will come back to it in greater detail). Thinking of many-particle 
quantum mechanics (and assuming that the particles do not interact with each other) , 
the solutions of the Dirac equation can be regarded as one-particle states, which can 
be occupied by the particles of the system. According to the Pauli Exclusion Principle, 
each state may be occupied by at most one particle. If one assumes that no states 
are occupied in the vacuum, a system of n particles is unstable because the energy 
of the system can be made negative and arbitrarily small by occupying n negative- 
energy states. However, this problem disappears if one assumes that in the vacuum all 
states of negative energy are already occupied. Then the n additional particles must 
occupy states of positive energy, and the system becomes stable. This consideration led 
Dirac to no longer think of the vacuum as "empty space," but instead to conjecture 
that the vacuum should be formed of a "sea" of quantum mechanical particles of 
negative energy. Dirac's conception was that the effects of all the particles of the 
sea should compensate each other in such a way that the sea cannot be observed. 
Likewise, in this picture an interacting system of n particles corresponds to the Dirac 
sea and n additional particles of positive energy which all interact with each other. 
This intuitive concept of the Dirac sea as a "sea of interacting particles" was not only 
useful for resolving the problem of the negative-energy solutions, but furthermore led 
to the prediction of anti-particles and pair creation/annihilation. To this end, Dirac 
considered an interacting system which at initial time t = is the vacuum. Then at 
a later time, one of the particles of the sea may no longer occupy a state of negative 
energy, but be instead in a positive-energy state. In this case, the system consists of 
one particle and one "hole" in the Dirac sea. Since the completely filled Dirac sea 
should be invisible, the hole appears as a virtual particle of energy and electric charge 
opposite to that of the unoccupied negative-energy state. Thus the virtual particle 
has positive energy, but its charge is opposite to that of an ordinary particle. This 
virtual particle is referred to as anti-particle. In the above process, particles and anti- 
particles are always generated in pairs, this explains the physical effect of pair creation. 
Conversely, a particle and a hole can recombine in a process called pair annihilation. 



1.3. Fock Space Quantization of the Free Dirac Field 

In this section we outline the canonical quantization of the free Dirac field (for 
details see [BD2, IZ]). For clarity, we first quantize without taking into account 
the Dirac sea and explain afterwards how the construction is to be modified in order 
to cure the problem of the negative-energy states. We begin with the one-particle 
Hilbert space {7i, (., .)) in the Hamiltonian framework (1.2.18). Clearly, the plane- 
wave solutions (1.2.23) are not square integrable, but we can normalize them in the 
distributional sense. More precisely, 

(*p-s6 I %.'6') = 6^ip-p') <Xpse I l\ps'e'y ■ (1-3.1) 
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In order to compute the inner product -<Xpse I l^Xps't'>~-, we first plug in tlie spectral 
projectors (1.2.20), which for convenience we now denote by Ilp-g := n(a;(p, e),p), 

<Xpse I 7°Xps'e'>- = '<^peXpse I 7° '^pe'Xps' e'>- = -<Xpse I ^pe 7° ^pe'Xps'e''^ ■ 

The matrix product Iijjf:^f^^I{j^^i is computed in the cases e = e' and e 7^ e' as follows, 

Ojj^ ^ (^7° -Pl + m -a;7° - pj + m 
^ ^ 2m 2m 

m + m){-^ + m) ^0 ^ ^ 

Ojj^ ^ 0^7° - FT + m a;7° -fy + m 
2m 2m 
^^0 - + m (^ + m)(-^ + m) „ r,, , tt 

where we set lo = \uj{p,e)\ and k = {uj,p). Hence the matrix products reduce to a 
multiple of the identity, and we can use the normalization (1.2.21) to obtain 

{^pse I ^fs'e') = Mp) SHp- ^) See' Sss' , (1-3.2) 



with (jj{p) := \oo{p, e)\ = y/\'p\'^ + m^. Readers who dislike this (5-normalization can also 
state (1.3.2) by saying that, similar to a Fourier transformation, the mapping 

is an isometry of Hilbert spaces. The factor dp/(2uj(p)) appearing here can be inter- 
preted as the Lorentz invariant measure on the mass shell (i.e. formally dp/ {2lo{p)) = 
6{k^ — m^) d'^k), and we abbreviate it in what follows by d/j.^. 

In many-particle quantum mechanics, the system where the n one-particle states 
^pisiei) • • • ) ^p„sn€n occupled is described by the Hartree-Fock state 

* = A • • • A . (1.3.3) 

Here the wedge product A is the anti-symmetrized tensor product. Due to the anti- 
symmetry, the wedge product vanishes if two of the one-particle wave functions Si e; 
coincide. This corresponds to the 

Pauli Exclusion Principle: each quantum mechanical state 

can be occupied by at most one 
particle. 

Particles which obey the Pauli Exclusion Principle are called fermions (whereas for 
bosons one uses instead of (1.3.3) the symmetric tensor product). Working with the n- 
particle state (1.3.3) also implies that the n particles are indistinguishable in the sense 
that if we exchange two particles, the wave function changes only by a physically 
irrelevant phase. 

A general n-particle state corresponds to a linear combination of Hartree-Fock 
states and is thus a vector of the Hilbert space JT" = A^TC. In quantum field theory, 
the number of particles is not fixed, and therefore the Dirac particles are described 
more generally by a vector of the fermionic Fock space T = (B^^qT"'- Notice that the 
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scalar product on T is induced by that on K; we denote it for clarity by (.|.):^. On 
the Fock space, we introduce the field operators ^ft^^ by 

and denote their adjoint with respect to the scalar product by ^psn^ '^pse = 

(^t^^)*. The operators ^t^^ and ^fse are referred to as the creation and annihilation 
operators, respectively. A straightforward calculation using our definitions and the 
normalization condition (1.3.2) yields that the field operators satisfy the canonical 
anticommutation relations 

(1-3.4) 

{^pse, ^I's'e'} = Mp) SHp- See' Sss' ■ 

The vacuum corresponding to these field operators, denoted by |0>, is a unit vector 
of T on which all annihilation operators vanish, 

%se|0>=0 forallp,s,e. (1.3.5) 

The Hartree-Fock states can be obtained from it by applying the creation operators, 

(1.3.3) = ^'t • • • *t |0> , 

and taking linear combinations, we can build up the whole Fock space from the vacuum. 

The energy of the Hartree-Fock state (1.3.3) is the sum of the energies u){p,e) of all 
particles, and a short calculation shows that this coincides with the eigenvalue of the 
following operator on the Fock space, 

^0 = V / iuip,e)^^^^^sedfip. (1.3.6) 

e,s 

Thus Hq is the Hamiltonian of the free many-particle theory. 

Since the factor Lo{p,e) in the integrand can be negative, the Hamiltonian (1.3.6) 
is not bounded from below. This is precisely the problem of the negative-energy so- 
lutions of the Dirac equation which we described at the end of the previous section. 
This problem disappears in quantum field theory as follows. According to the con- 
cept of the Dirac sea, all negative-energy states should be occupied in the vacuum. 
This is implemented here by redefining the vacuum; namely we replace (1.3.5) by the 
conditions 

l'p-,+ |0> = = ^'t_|0> forallp,s. (1.3.7) 

Since the anti-particles correspond to "holes" in the Dirac sea, we reinterpret the 
creation operators for the negative-energy states as annihilation operators and vice 
versa, i.e. we perform the formal replacements 

^ *t _ ■ (1.3.8) 

This is convenient because after the reinterpretation, the new vacuum (1.3.7) again 
satisfies the usual conditions (1.3.5). The Hamiltonian (1.3.6) transforms under the 
replacements (1.3.8) into 

^ „ J IR, „ J IR,'^ 
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The first part of this Hamiltonian is positive. Using the anti-commutation rela- 
tions (1.3.4), one sees that the second term in (1.3.9) is an infinite negative constant. 

Using the argument that adding a constant to the total energy of a system is nothing 
more than introducing a new convention for the zero point of energy measurements, 
one drops this second term and redefines the Hamiltonian by 

Ho = Y.j \<^iP^s)\%se^Psedl^p. (1.3.10) 
e,s -^^^ 

This Hamiltonian is positive and vanishes on the vacuum, giving rise to a satisfying 
physical theory. However, dropping the second summand in (1.3.9) was a problematic 
step in the construction. We postpone the discussion of this point to §2.2. 

1.4. Classical Gauge Theories 

We now briefly introduce the framework of local gauge theories (for a more detailed 
introduction see for example [Ga]). In order to avoid confusion between covariant 
derivatives V and gauge-covariant derivatives D we restrict attention to Minkowski 
space. The generalization to curved space-time will be described in connection with 

the Dirac equation in §1.5. The starting point for gauge theories is the observation 
that changing the electromagnetic potential by the gradient of a real-valued function 
A, 

A — ^ A + dk, (1.4.1) 

leaves the field tensor unchanged, 

Fjk — Fjk + djdkk — dkdjA = Fj^ . 

The equations of classical electrodynamics (1.1.2, 1.1.3) do not involve the electromag- 
netic potential, only its field tensor. Therefore, these equations are obviously invariant 
under the transformation (1.4.1). In the quantum mechanical wave equations (1.2.2, 
1.2.7) the electromagnetic potential does appear, but only in combination with a par- 
tial derivative in the operators dk — iA^. These operators transform under (1.4.1) as 
follows, 

dk-iAk — > dk-iAk-idkA = e'^ {dk - lAk) e''^ . 
Writing the transformation law with the multiplication operators e"^^^ reveals that the 
equations of quantum mechanics are invariant under (1.4.1) if at the same time the 
local phase of the wave functions is transformed according to 

^ — > e*^^'. (1.4.2) 

Finally, these local phase transformations leave the Dirac current (1.2.9) unchanged. 
We conclude that classical field theory and relativistic quantum mechanics are in- 
variant under the transformation (1.4.1, 1.4.2), which is referred to as a local gauge 
transformxition of electrodynamics. The invariance of the physical equations under 
local gauge transformations can be interpreted as a physical symmetry, the local gauge 
symmetry. 

Extending the above concept leads to the mathematical framework of gauge theo- 
ries. We first note that the phase factor e*^ in (1.4.2) can be interpreted as the opera- 
tion of an element of the Lie group U (1) on Likewise, the factors djk = ie'^^dje~^^ 
can be regarded as elements of the corresponding Lie algebra u(l). Since in (1.4.1) 
this factor is added to the components Aj of the electromagnetic potential, it is nat- 
ural to also consider the Aj as «(l)-valued functions. In generalization, we let the 
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gauge group Q be an arbitrary Lie group in a given matrix representation on the wave 
functions (the wave functions may have more than four components; a typical example 
is ^ = U{p) and ^'(.t) G ® C^). The corresponding Lie algebra in its representa- 
tion on the wave functions is denoted by g. We introduce the gauge potentials Aj as 
g- valued functions on M. For any smooth function U : M ^ Q, the transformation 
of the wave functions 

^(x) — > U{x)^{x) (1.4.3) 

is referred to as a local gauge transformation. Clearly, partial derivatives of ^ do 
not behave well under gauge transformations because we pick up derivatives of U. 
This problem disappears if instead of partial derivatives we consider gauge-covariant 
derivatives 

Dj = d, - tA, , (L4.4) 

provided that the gauge potentials transform according to 

Aj — > UAjU'^ + iU {djU'^) . (1.4.5) 

Namely, a short calculation shows that the gauge-covariant derivative behaves under 
gauge transformations according to 

Dj — ^ UDjU'^, (1.4.6) 

and thus the gauge-covariant derivatives of obey the simple transformation rule 

Dj^ — > UDj^. 

Next we need to introduce the gauge potentials into the physical equations and 
formulate the equations that describe the dynamics of the gauge fields. Wc just saw 
that in order to ensure gauge invar iance, one should work with gauge-invariant deriva- 
tives instead of partial derivatives. The simplest method for making the physical 
theory gauge invariant is to replace all partial derivatives by the corresponding gauge- 
invariant derivatives, 

d — > D. (1.4.7) 

This ad-hoc method is in physics called the minimal coupling procedure. For the 
equations of quantum mechanics it can be motivated if one keeps in mind that with 
a local gauge transformation of the form U{x) = 1 — iAj (x — py + o{x — p) we can 
always arrange that A[p) = 0. In this gauge, the gauge-covariant derivatives coincide 
at p with the partial derivatives, and thus we can state minimal coupling as follows. 

Around each space-time point p there is a gauge such that the 

quantum mechanical equations coincide at p with the equations (1.4.8) 

without gauge fields. 

In this formulation, minimal coupling can be understood similar to the strong equiva- 
lence principle; we only need to replace "coordinate system" by "gauge" and "gravita- 
tional field" by "gauge field." In the example of the free Dirac equation (i^— m)^' = 0, 
minimal coupling yields the equation 

il^idj -lAj)-^ = , 

which describes a behavior of a Dirac particle in the presence of the gauge field. This 
equation can also be derived by varying in the corresponding Dirac action 

Sj) = / I [i^^idj - iAj) - m) dfi. 
Jm 
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In order to get the equations for the gauge field, we construct out of the gauge-covariant 
derivative the field tensor by 

Fjk = i [Dj,Dk] = djAk - dkAj - i[Aj,Ak] . 

Since its behavior under gauge transformation is simply 

Fjk — U Fjk , 

we can generalize the action of the electromagnetic field in (1.1.9) by the Yang-Mills 
action 

where "Tr" is a suitably normalized matrix trace. The total action is simply the sum 
of the Dirac and Yang-Mills actions, 

S = Sb + Sym ■ 

Varying A) we obtain the coupled Dirac- Yang/Mills equations which describe the 
classical dynamics. 

1.5. Dirac Spinors in Curved Space-Time 

Dirac spinors arc often formulated on a manifold using frame bundles, cither an 
orthonormal frame [B, Fr] or a Newman-Penrose null frame [PR, Ch]. We here 
outline an equivalent formulation of spinors in curved space-time in the framework 
of a U{2,2) gauge theory (for details see [F2]). We restrict attention to the Dirac 
operator in local coordinates; for global issues like topological obstructions for the 
existence of spin structures see e.g. [LM]. We let M be a 4-dimensional manifold 
(without Lorentz metric) and define the spinor bundle SM as a vector bundle over M 
with fibre C^. The fibres are endowed with a scalar product -<;.|.;^ of signature (2,2), 
which is again referred to as the spin scalar product. Sections in the spinor bundle 
are called spinors or wave functions. In local coordinates, a spinor is represented by a 
4-component complex function on space-time, usually denoted by ^{x). Choosing at 
every space-time point a pseudo-orthonormal basis {eot)a=i,...,A in the fibres, 

-<e-o\eQ>- = Sa Sal3 , Si = S2 = l, S3 = S4 = -1 (1.5.1) 

and representing the spinors in this basis, ^ = ^'^Cq,, the spin scalar product takes 
again the form (1.2.8). Clearly, the basis (e^) is not unique, but at every space-point 
can be transformed according to 

where U is an isometry of the spin scalar product, U G U(2,2). Under this basis 
transformation the spinors behave as follows, 

^'"(x) — > U^{x)^'^{x). (1.5.2) 

Due to the analogy to (1.4.3) we interpret this transformation of the wave functions 
as a local gauge transformation with gauge group Q = U{2, 2). We refer to a choice of 
the spinor basis (cq,) as a gauge. 

Our goal is to formulate classical Dirac theory in such a way that the above U (2, 2) 

gauge transformations correspond to a physical symmetry, the U (2, 2) gauge symmetry 
. To this end, we shall introduce the Dirac operator as the basic object on M, from 
which we will later deduce the Lorentz metric and the gauge potentials. We define a 
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differential operator T> of first order on the wave functions by the requirement that in 
a chart and gauge it should be of the form 

V = iGHx)^ + B{x) (1.5.3) 

with suitable (4x4)-matrices and B. This definition does not depend on coordinates 
and gauge, although the form of the matrices and B clearly does. More precisely, 
under a change of coordinates x^ the operator (1.5.3) transforms into 

whereas a gauge transformation * — > yields the operator 

UVU-'^ = i [UGW-^) ^ + {UBU-^ + iUG^{djU-'^)) . (1.5.5) 

We define the Dirac operator by the requirement that by choosing suitable coordinates 
and gauge, one can arrange that the matrices G-' in front of the partial derivatives 
"coincide locally" with the Dirac matrices of Minkowski space. 

Def. 1.5.1. A differential operator D of first order is called Dirac operator if 
for every p £ M there is a chart (x*, U) around p and a gauge (eQ,)a=i,...,4 such that V 
is of the form (1.5.3) with 

Gi(p) = y , (1.5.6) 

where the are the Dirac matrices of Minkowski space in the Dirac representa- 
tion (1.2.5). 

It may seem unconventional that in this definition the zero order term B of the 
Dirac operator is not at all specified. Furthermore, our formulation as a gauge theory 
seems incomplete because we introduced local gauge transformations (1.5.2, 1.4.3), but 
not a corresponding gaugc-covariant derivative (1.4.4). In order to clarify the situation, 
we shall now construct from the Dirac operator a gauge-covariant derivative D, also 
referred to as spin derivative. To this end, we must find matrices Aj which transform 
under local gauge transformations according to (1.4.5). This construction will also 
reveal the structure of the matrix B, and this will finally lead us to the definition of 
the so-called physical Dirac operator, which involves precisely the gravitational and 
electromagnetic fields. 

In the chart and gauge where (1.5.6) holds, it is obvious from (1.2.3) that the 
anti-commutator of the matrices G^{p) gives the Minkowski metric. Using the trans- 
formation rules (1.5.4, 1.5.5), one sees that in a general coordinate system and gauge, 
their anti-commutator defines a Lorentz metric, 

g^\x)l = \ {G\x), G\x)}. (1.5.7) 

In this way, the Dirac operator induces on the manifold a Lorentzian structure. We 
refer to the matrices G^ as the Dirac matrices in curved space-time. Since we can 
arrange that these matrices coincide locally with the Dirac matrices of Minkowski 

space, all manipulations of Dirac matrices can be performed at any given space-time 
point in an obvious way. In particular, the pseudoscalar matrix (1.2.13) now takes the 
more general form 

P{^) = ^sjkimG^G'^G^G^, 
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where the anti-symmetric tensor Ej^im differs from the anti-symmetric symbol ej^im by 
the volume density, Sjkim = ^/\detg\€jklm■ The pseudoscalar matrix gives us again the 
notion of even and odd matrices and of chirality (1.2.14). Furthermore, we introduce 

the bilinear matrices a^^ by 

As in Minkowski space, the matrices 

, pG^ , 1 , ip, a^^ (1.5.8) 

form a basis of the 16-dimensional (real) vector space of selfadjoint matrices (with 
respect to -<.|.>-). The matrices G^ and pG^ are odd, whereas 1, ip and a^^ are even. 

For the construction of the spin connection we must clearly consider derivatives. 
The Lorentzian metric (1.5.7) induces the Levi-Civita connection V, which defines 
the covariant derivative of tensor fields. Taking covariant derivatives of the Dirac 
matrices, VkG^ = dkG^ + F^^ G\ we obtain an expression which behaves under co- 
ordinate transformations like a tensor. However, it is not gauge covariant, because a 
gauge transformation (1.4.3) yields contributions involving first derivatives of U. More 
precisely, according to (1.5.5), 

VkG^ VkiUG^U-') = U{VkG^)U-' + {dkU)GW-' + UG^dkU-^) 

= U{VkG^)U-^ - [U{dkU-^), UGW-^] . (1.5.9) 

We can use the second summand in (1.5.9) to partially fix the gauge. 

Lemma 1.5.2. For every space-time point p G M there is a gauge such that 

VkG\p) = (1.5.10) 

(for all indices j, k). 

Proof. We start with an arbitrary gauge and construct the desired gauge with 
two subsequent gauge transformations: 
(1) The matrix djp is odd, because 

= djl = dj{pp) = {djp)p + p{djp) . 

As a consequence, the matrix ip{djp) is selfadjoint. We can thus perform a 
gauge transformation U with U{p) = 1, djU{p) = \p{djp). In the new gauge 
the matrix djp(p) vanishes, 

9jP\p — ' dj{UpU'^)\p = djp\p + ]^[p{djp), = djp\p- p^{djp)\p = 0. 

Differentiating the relation {p, G-^ } = 0, one sees that the matrix '^kGj^ is 
odd. We can thus represent it in the form 

V.G; = ALG|^ + eLpG- (1.5.11) 

with suitable coefficients A^^ and Qj,^- 

This representation can be further simplified: According to Ricci's Lemma, 
Vrifi'-'^ = 0. Expressing the metric via the anti-commutation relations and dif- 
ferentiating through with the Leibniz rule, we obtain 

= {V„G^ G"} + V„G'^} 

= 2A^„„r' - &im^iP<^"'' + 2AL5™^' - 0L2W™^' (1.5.12) 
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and thus 

Ki^g^^ = -AL^^'- (1-5.13) 
In the case j = k m, (1.5.12) yields that Qiim = 0. For j ^ k, we obtain 

®nj ^^'^ + ®nfc ~ ^^'^ ®nj ~ ®nfc ^^"^ ^ denote fixed indices, no 
summation is performed). We conclude that there are coefficients 0^ with 

©L = (^i^^L- (1-5.14) 
(2) We perform a gauge transformation U with U{p) = 1 and 

dkU = —e^p - \Ktn9'''^ml. 

Using the representation (1.5.11) together with (1.5.13, 1.5.14), the matrix 
VfeG'' transforms into 

= + e, pG^ - G, pG^ - J AZ 5"' [c^mi, G^] 

= Km -\ ^tn 5"' 2i {Gm sj - Gi Si,) 

= KmG"^ + l^tnG'^'G^ - ^ALg- = 0. ■ 



We call a gauge satisfying condition (1.5.10) a norm,al gauge around p. In order to 
analyze the remaining gauge freedom, wc let [/ be a transformation between two normal 
gauges. Then according to (1.5.9) and (1.5.10), the commutator \U{dkU~^), UG^U~^] 
vanishes at p or, equivalently, 

[iidkU-')U, G% = 0. 

As is easily verified in the basis (1.5.8) using the commutation relations between 

the Dirac matrices, a matrix which commutes with all Dirac matrices is a multi- 
ple of the identity matrix. Moreover, the matrix i{djU^^) U is selfadjoint because 
iiidjU-^)U)* = -iU-^{djU) = -idj{U~^U)+i{djU-^)U = i{djU-^)U. We conclude 
that the matrix i{djU~^) U is & real multiple of the identity matrix, and transforming 
it unitarily with U we see that it also coincides with the matrix iU {djU~^). Under 
this strong constraint for the gauge transformation it is easy to find expressions with 
the required behavior (1.4.5) under gauge transformations. Namely, setting 

aj = ^ Re Tr(Gj B) 1 , (1.5.15) 

where "Tr" denotes the trace of a 4 x 4-matrix, one sees from (1.5.5) that 

aj — ^ + ^ Re TV (GjG'' iidkU'^) C/) 1 = aj + iU{djU-^) . 

We can identify the aj with the gauge potentials Aj and use (1.4.4) as the definition 
of the spin connection. 

Def. 1.5.3. The spin derivative D is defined by the condition that it behaves 
under gauge transformations (1.4-3) according to (1.4-6) and in normal gauges around 
p has the form 
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with the potentials aj according to (1.5.15). 

In general gauges, the spin derivative can be written as 

^3 = ^ - iEj - iaj (1.5.17) 

with additional matrices Ej (x) , which involve the Dirac matrices and their first deriva- 
tives. A short calculation shows that the trace of the matrix Ej does not change under 
gauge transformations, and since it vanishes in normal gauges, we conclude that the 
matrices Ej are trace-free. A straightforward calculation yields that they are explicitly 
given by 

In the next two theorems wc collect the basic properties of the spin connection. 

Theorem 1.5.4. The spin derivative satisfies for all wave functions ^ , $ the equa- 
tions 

[Dk,&] + Ti^G' = (1.5.18) 
dj I = ^Dj^ I + I Dj^^ . (1.5.19) 

Proof. The left side of (1.5.18) behaves under gauge transformations according 
to the adjoint representation . ^ U . U"^ of the gauge group. Thus it suffices to check 
(1.5.18) in a normal gauge, where 

[Dk,&] + riG^ = - '-ReTh:{GjB)[l,&] = 0. 

Since both sides of (1.5.19) are gauge invariant, it again suffices to consider a 
normal gauge. The statement is then an immediate consequence of the Leibniz rule 
for partial derivatives and the fact that the spin derivative differs from the partial 
derivative by an imaginary multiple of the identity matrix (1.5.16). ■ 

The identity (1.5.18) means that the coordinate and gauge invariant derivative of 
the Dirac matrices vanishes. The relation (1.5.19) shows that the spin connection is 
compatible with the spin scalar product. 

We define torsion T and curvature TZ of the spin connection as the following 2- 
forms, 

Theorem 1.5.5. The spin connection is torsion-free. Curvature has the form 

T^jk = \ Rmnjk tr'"" + ^ {djOk - dkOj) (1.5.20) 

where Rmnjk is the the Riemannian curvature tensor and the aj are given by (1.5.15). 
Proof The identity (1.5.18) yields that 

[Dj, Gk] = [Dj, g^iG'] = {djgki)G' - gki^'j^G^ = Tf.Gm 
and thus, using that the Levi-Civita connection is torsion-free, 

= 2 ''^Tk ~ ^kj) Gm = 0- 
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Again using (1.5.18), we can rewrite the covariant derivative as a spin derivative, 

Gi Vku' = [Dk, Giu'] . 
Iterating this relation, we can express the Riemann tensor (1.1.7) by 
Gi R]^iu^ = [Dj, [Dk, Giu']] - [Dk, [D„ Giu']] 

= [[Dj, Dk], Giu^] = -2i [n,k, Giu^] . 
This equation determines curvature up to a multiple of the identity matrix, 

'R-jk{x) = - Rmnjk + ^jk^ ■ 

Thus it remains to compute the trace of curvature, 

J TviUjk) 1 = 1 TridjAk - dkAj) 1 = ^ {djak - d^aj) , 
where we used (1.5.17) and the fact that the matrices Ej are trace- free. ■ 

We come to the physical interpretation of the above construction. According 
to Lemma 1.5.2 we can choose a gauge around p such that the covariant deriva- 
tives of the Dirac matrices vanish at p. Moreover, choosing normal coordinates and 
making a global (=constant) gauge transformation, we can arrange that G{p) = 7-^ 
and djgki{p) = 0. Then the covariant derivatives at p reduce to partial derivatives, 
and we conclude that 

G\p) = 7^ dk&{p) = 0. (1.5.21) 

These equations have a large similarity with the conditions for normal coordinates (1.1.4), 
only the role of the metric is now played by the Dirac matrices. Indeed, differenti- 
ating (1.5.7) one sees that (1.5.21) implies (1.1.4). Therefore, (1.5.21) is a stronger 
condition which not only gives a constraint for the coordinates, but also for the gauge. 
We call a coordinate system and gauge where (1.5.21) is satisfied a normal reference 
frame around p. 

In a normal reference frame, the Dirac matrices, and via (1.5.7) also the metric, 
are the same as in Minkowski space up to the order o{x — p). According to the strong 
equivalence principle, the Dirac equation at p should coincide with that in Minkowski 
space. Now we use minimal coupling in the formulation (1.4.8) to conclude that there 
should be a normal gauge such that all gauge potentials vanish at p, and thus the Dirac 
operator at p should coincide with the free Dirac operator i^. This physical argument 
allows us to specify the zero order term in (1.5.3). 

Def. 1.5.6. A Dirac operator D is called physical Dirac operator if for any 

p & M there is a normal reference frame around p such that B{p) = 0. 

Equivalently, the physical Dirac operator could be defined as a differential operator 
of first order (1.5.3) with the additional structure that for any p e M there is a 
coordinate chart and gauge such that the following three conditions are satisfied, 

&{p)=jj, dk&{p) = 0, Bip) = 0. 

This alternative definition has the disadvantage that it is a-priori not clear whether the 
second condition d^G^p) = can be satisfied for a general metric. This is the reason 
why we preferred to begin with only the first condition (Def. 1.5.1), then showed that 
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the second condition can be arranged by choosing suitable coordinates and gauge, and 
satisfied the third condition at the end (Def. 1.5.6). 

In general coordinates and gauge, the physical Dirac operator can be written as 

V = iGWj = iG^ {dj - iEj - iaj) , 

where D is the spin connection of Def. 1.5.3. The matrices Ej take into account the 
gravitational field and are called spin coefficients, whereas the aj can be identified 
with the electromagnetic potential (compare (1.2.7)). We point out that the gravita- 
tional field cannot be introduced into the Dirac equation by the simple replacement 
rule (1.4.7) because gravity has an effect on both the Dirac matrices and the spin co- 
efficients. But factorizing the gauge group as U{2, 2) = U{1) x SU{2, 2), the SU{2, 2)- 
gaugc transformations arc linked to the gravitational field because they infiuence 
and Ej, whereas the U{1) can be identified with the gauge group of electrodynamics. 
In this sense, we obtain a unified description of electrodynamics and general relativity 
as a U{2, 2) gauge theory. The Dirac equation 

(P-m)* = 

describes a Dirac particle in the gravitational and electromagnetic field. According 
to Theorem 1.5.5, the curvature of the spin connection involves both the Ricmann 
tensor and the electromagnetic field tensor. We can write down the classical action in 
terms of these tensor fields, and variation yields the classical Einstein-Dirac-Maxwell 
equations. 

For the probabilistic interpretation of the Dirac equation in curved space-time, 
we choose a space-like hypersurface 7i (corresponding to "space" for some observer) 
and consider in generalization of (1.2.17) on solutions of the Dirac equation the scalar 
product 

(*!$)= / ^^{G^i^j^ydun, (1.5.22) 

where v is the future-directed normal on H and dfx-H is the invariant measure on the 
Riemannian manifold TC. Then (^'|^) is the normalization integral, which we again 
normalize to one. Its integrand has the interpretation as the probability density. In 
analogy to (1.2.9) the Dirac current is introduced by J'^ = | G'^^y. Using Theo- 
rem 1.5.4 one sees similar as in Minkowski space that the Dirac current is divergence- 
free, Vfcj'^ = 0. From Gauss' theorem one obtains that the scalar product (1.5.22) 
does not depend on the choice of the hypersurface H. 

We finally remark that using Theorem 1.5.4 together with Gauss' theorem, one 
easily verifies that the physical Dirac operator is Hermitian with respect to the inner 
product 

<^|$>:= / ^^\<^ydn, (1.5.23) 
Jm 

in which the wave functions (which need not satisfy the Dirac equation but must have a 
suitable decay at infinity) are integrated over the whole space-time. This inner product 
is not positive, but it will nevertheless play an important conceptual role in the next 
chapters. 
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The Fermionic Projector in the Continuum 

In the previous chapter we introduced the concept of the Dirac sea in order to 
give the negative-energy solutions of the free Dirac equation a physical meaning as 
anti-particle states (see §1.2, §1.3). Now we shall extend this concept to the case with 
interaction. We will see that the Dirac sea can still be introduced as a universal object 
in space-time, described mathematically by the so-called fermionic projector (§2.3). We 
develop the mathematical methods for an explicit analysis of the fermionic projector 
in position space (§2.5) and finally consider the normalization of the fermionic states 
(§2.6). 

2.1. The External Field Problem 

We begin with the simplest interaction: a classical external field in Minkowski 
space. In this situation the Dirac wave function is a solution of the Dirac equation 

[i^ + B-m)^ = 0, (2.1.1) 

where the operator B is composed of the external potentials (as an example one may 
choose B = with A the electromagnetic potential (1.2.7)). In order to have current 
conservation (i.e. the identity dkJ'^ = with J according to (1.2.9)), we always assume 
that B is Hermitian (with respect to the spin scalar product). If B is static (=time 
independent), we can separate the time dependence of the wave function with a plane 
wave ansatz, 

^(t,x) = e-^'^* V(^) • (2.1.2) 
The separation constant oj has the interpretation as the energy of the solution. Thus 
the energy is a conserved quantity, and its sign distinguishes between solutions of 
positive and negative energy. In more mathematical terms, for a static potential the 
Hamiltonian in (1.2.18) is time independent, and the sign of the spectrum of h gives 
a splitting of the solution space of the Dirac equation into the subspaces of positive 
and negative energy, respectively. As a consequence, our previous construction of the 
Dirac sea can be adapted: When building up the Fock space from the one-particle 
states, we cure the problem of the negative-energy solutions similar to (1.3.7-1.3.9) 
by redefining the vacuum and by formally exchanging the creation and annihilation 
operators corresponding to the negative-energy solutions. 

The situation becomes much more difficult when B is time- dependent. In this case, 
the separation ansatz (2.1.2) no longer works. The energy is not conserved, and it is 
even possible that a solution which has positive energy at initial time will have negative 
energy at a later time. Expressed more mathematically, the Hamiltonian in (1.2.18) 
now depends explicitly on time, and therefore the sign of the spectrum of h no longer 
gives a canonical splitting of the solution space of the Dirac equation (this splitting 
would also depend on time). As a consequence, it is not clear which solutions have 
the interpretation as "negative-energy solutions" and thus correspond to anti-particle 

29 



30 



2. THE FERMIONIC PROJECTOR IN THE CONTINUUM 



states (1.3.8). For this reason, it is no longer obvious how to quantize the Dirac field 
in a canonical way. This difficulty is usually referred to as the external field problem. 

It becomes most evident in the setting of Klein's paradox, where one considers a step 
potential whose amplitude is larger than the mass gap (see [BDl, T]). However, we 
point out that the external field problem appears already for arbitrarily weak external 
fields, simply because the time dependence of B leads to a complicated mixing of the 
solutions of positive and negative energy. We could speak of a "solution of negative 
energy" only if it were a superposition of states which all had negative energy, and 
there seems no reason why such solutions should exist. 

It is instructive to discuss the external field problem in the setting of pertubation 
theory. Consider a first order perturbation of the plane-wave solution "^psej 

^ = *pse + A* + 0(^2) (2.1.3) 

Substituting this ansatz into the Dirac equation (2.1.1), we obtain to first order in B 
the inhomogeneous Dirac equation 

{i^ - m) Al- = -B l-p-se • (2.1.4) 

If Sm{x,y) is a Green's function of the free Dirac equation, characterized by the dis- 
tributional equation 

(i^^-m) Sm{x,y) = S^{x-y), (2.1.5) 
we can construct a solution of (2.1.4) by 

= - j d^ys{x,y)B{y)^pse{y) (2.1.6) 

(in order not to distract from the main ideas, we here calculate on a formal level; the 
analytic justification will be given at the end of §2.2). If the Green's function were 
unique, (2.1.3, 2.1.6) would give a unique procedure for perturbing the negative-energy 
solutions of the vacuum, making it possible to extend the notion of "negative-energy 
state" to the interacting theory (at least in first order perturbation theory). 

The problem is that the Green's function is not unique., as we now briefiy recall 
(for details see [BDl]). Taking the Fourier transform of (2.1.5), 

sm{x,y) = J ^Sm{k)e-^'(^-y\ (2.1.7) 

we obtain the algebraic equation 

{}^-m)sm{k) = 1. 

Since the matrix ^ — m is singular on the mass shell (see the argument after (1.2.19)), 
this equation can be solved for Smik) only after using a ztie-regularization on the mass 
shell. The most popular choices are the advanced and the retarded Green's functions 
defined by 

Sm{k) = lim To — ^ "I ■ in and s!l(k) = lim — ^ ^ . , n ) (2.1.8) 

respectively (with the limit e \ taken in the distributional sense). Computing their 
Fourier transform (2.1.7) with residues, one sees that they are causal'm. the sense that 
their supports lie in the upper and lower light cone, respectively, 

supps^(x, .) C , supps^(x, .) C . (2.1.9) 
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Another commmon choice is the Feynman propagator 

Taking the Fourier transform (2.1.7) with residues, one finds 

= T?^ f m + m)er^^^^-y) d/x,- (2.1.11) 

with t = {y — x)^ and Lo{p), d^j^ as introduced after (1.3.2). Here e denotes the step 
function e(x) = 1 for x > and e{x) = — 1 otherwise. Thus for positive t we get the 
integral over the upper mass shell, whereas for negative t we integrate over the lower 
mass shell. As a consequence, the Feynman propagator is not causal, but it is instead 
characterized by the frequency conditions that it is for positive and negative time t 
composed only of positive and negative frequencies, respectively. More systematically, 
the defining equation for the Green's function (2.1.5) determines Sm only up to a 
solution of the homogeneous Dirac equation. Thus we can write a general Green's 
function Sm in the form 

Sni{x,y) = Sm(a;,y) + a{x,y) , (2.1.12) 
where a(x, y) is a linear combination of plane-wave solutions in the variable x, i.e. 



'IR- 

with a suitable complex- valued function cpse{y)- 

Due to the non-uniqueness of the Green's function (2.1.12), the relations (2.1.3, 
2.1.6) do not give a unique procedure for perturbing the plane- wave solutions 
In particular, it is not clear how to extend the notion of "negative energy state" to 
the interacting theory. This corresponds precisely to the external field problem. We 
conclude that in a perturbative approach, the external field problem becomes manifest 
in the non-uniqueness of the perturbation expansion. 

Feynman [Fe] gave the frequency conditions in the Feynman propagator the phys- 
ical interpretation as "positive-energy states moving to the future" and "negative- 
energy states moving to the past". Identifying "negative-energy states moving to the 
past" with "antiparticle states" he concluded that the Feynman propagator is distin- 
guished from all other Green's functions in that it takes into account the particle/anti- 
particle interpretation of the Dirac equation in the physically correct way. Feyn- 
man proposed to perform the perturbation expansion exclusively with the Feynman 
propagator, thereby making the perturbation expansion unique. The flaw is that the 
frequency conditions are not invariant under general coordinate and gauge transfor- 
mations (simply because such transformations "mix" positive and negative frequen- 
cies), and therefore Feynman's method is not compatible with the equivalence prin- 
ciple and the local gauge principle. This is not a problem for most calculations in 
physics, but it is not satisfying conceptually. Another approach is to work with the 
so-called Hadamard states [H, Wa2]. The disadvantage of this approach is that the 
states of the quantum field no longer have a particle interpretation. In other words, the 
notions of "particle" and "anti-particle" depend on the local observer, and therefore 
also the notion of the Dirac sea loses its universal meaning. We proceed in §2.2 by 
showing that the Dirac sea can indeed be introduced as a global object of space-time, 
even in the presence of a general interaction. 
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We finally remark that the above arguments apply in the same way for second 
quantized fields: we only need to replace B by an operator on a suitable bosonic Fock 
space. Also, our assumption that B is an external field is merely a technical simpli- 
fication (more precisely, we disregard the dynamical equations for the bosonic fields, 
thereby also avoiding the divergences of QFT and the renormalization procedure) , but 
it is not essential for our arguments. Namely, in a time-dependent interacting system, 
the Dirac wave functions satisfy (2.1.1), where B{t,x) is determined by the dynamical 
equations of the whole system. Solving these equations we can (at least in principle) 
compute B, and applying our above arguments with this B as an external field, we 
conclude that the notion of "negative-energy state" ceases to exist. In what follows we 
will for simplicity again consider an external field, but we shall come back to coupled 
systems in §2.4. 



2.2. The Causal Perturbation Expansion 

We saw in the previous section that the external field problem for the Dirac equa- 
tion (2.1.1) is equivalent to the non-uniqueness of the perturbation expansion for the 
individual states of the Dirac sea. We shall now solve this problem by considering the 
collection of all states of the Dirac sea. This will reveal an underlying causal structure, 
which will enable us to make the perturbation expansion unique^. We closely follow 
the constructions given in [F4]. 

In the vacuum, out of all plane-wave solutions of negative energy we form the 
object 

P^'^-(x,y) := / \^ps-^^^ps-\diip. (2.2.1) 

Using the explicit form of the plane-wave solutions (1.2.23, 1.2.22, 1.2.20), we obtain 
the covariant formula 

P^^'^{x, y) = [ 7^ (-^ + m) 6{k^ - m^) Q{-hP) £-^'=(^-2/) , (2.2.2) 
J (27r)* 

which also shows that P^^^{x,y) is a well-defined distribution. In order to get a con- 
nection to causality, we decompose P^^'^{x,y) in the form 

P'''"{x,y) = l{Pmix,y) - kmix,y)), (2.2.3) 
where Pm and km are the distributions 

m^) e-^'=(="-2') (2.2.4) 

m^) e{k^) e-^'=(^-2') . (2.2.5) 



Pm{x,y) 



(2^ 
;4 



{1^ + m) 5{k^ 



/d k 
— (^ + m) d{k'^ - 



We remark for clarity that our "causal perturbation expansion" docs not seem to be related to 
Scharf's "causal approach" to QED [S]. Scharf uses causality to avoid the ultraviolet divergences of 
perturbative QED, whereas in our setting of an external field all Feynman diagrams are finite anyway. 
On the other hand, Scharf is interested only in the scattering states, whereas our goal is to describe 
the dynamics also for intermediate times. 
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In order to relate the distribution km to the advanced and retarded Green's functions, 
we substitute the distributional equation 

/I 1 \ 

lim — = 27ri 6(x) 

e\o \x — ie X + i£ J 

into the formula for km in momentum space, 
kmip) = + m) - m^) e(/) 
= (-d + m) lim 

= {4 + m) lim 

27ri e\0 

Using (2.1.8) we get the simple formula 



p2 _ ^2 _ ^g- p2 _ ^2 _|_ ^g- 



p2 _ ^2 _ ^gpO ^2 _ j^2 _|_ ^g-pO ^ 



(2.2.6) 



= ^ [sl, - s^^) . (2.2.7) 
The support property (2.1.9) yields that km is causal in the sense that 

suppA;„i(x,.) C Jx ■ 

The distribution pm is not causal, but it can be deduced from km as follows. For a 
diagonalizable matrix A with real eigenvalues we can uniquely define its absolute value 
\A\ as the diagonalizable matrix with non- negative eigenvalues and \A\'^ = . The 
matrix ^ + m in (2.2.4, 2.2.5) is diagonalizable with non-negative eigenvalues and thus 

|e(/c°) (^ + m)| = li + m. (2.2.8) 

Before applying this relation to {2.2 A, 2.2.5), it is useful to consider the above distribu- 
tions P^°^, ... as integral kernels of corresponding operators on the wave functions 
in space-time, for example 

(P--M/)(x) := J P''^{x,y)^{y)d'y. (2.2.9) 

Then the operators km and Pm are diagonal in momentum space, and so (2.2.8) gives 
rise to the formal identity 

= \km\ , (2.2.10) 

where | . | now is the absolute value of an operator on wave functions in Minkowski 
space. With (2.2.3) and (2.2.7, 2.2.10) we have related the fermionic projector to the 
causal Green's functions in a way which can be generalized to the interacting theory, 
as we shall now make precise. 

We begin with the perturbation expansion for the causal Green's functions. The 
retarded Green's function in the presence of the external field B, denoted by s^, is 
characterized by the conditions 

{i^ + B-m)s^ix,y) = S\x-y) , snpps^{x, .) C . (2.2.11) 



The existence and uniqueness of the advanced Green's functions follows from the gen- 
eral theory of linear hyperbolic PDEs [J, Ta]. In short, for the existence proof one 
considers the solution of the Cauchy problem 



(i^ + ^ - m) ^' = / G C^{{to, oo) X M3)4 ^ ^(-^Q^ x) = ; 
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by linearity it can be expressed as an integral over the inhomogeneity, 



7ir4 



To prove uniqueness, one considers the difference of two retarded Green's functions ^ 
and 2> 

Then ^(x) is for fixed y a solution of the homogeneous Dirac equation which vanishes 
identically on the half space < The uniqueness of the solution of the Cauchy 
problem yields that ^' = 0. 

Expanding (2.2.11) in powers of B, one obtains the perturbation series = 
X^^O^(rt)' '^liere s^q^ = is the advanced Green's function of the vacuum, and the 
other summands are determined by the conditions that they are causal, supp (x, ■) € 
J^, and satisfy the inductive relations 

(i^-m)4) = -^4,_i) (n>l). 

Here we again used the operator notation (2.2.9) and considered ;B as a multiplication 
operator. The operator product 

{-s^Bs^){x,y) = - J d'zs':^{x,z)B{z)sUz,y) (2.2.12) 

is causal in the sense that B{z) enters only for z e L{^nLy (the analytic justification of 
this and all other operator products in this section will be given in Lemma 2.2.2 below). 
In particular, the support of (2.2.12) is again in the past light cone. Furthermore, it 
satisfies the relation 

(i^_^)(_,A ^^A) = 

and can thus be identified with the operator s^^^. By iteration, we obtain for the other 
terms of the perturbation series the explicit formulas 



We conclude that the retarded Green's function can be represented as 

fc=0 



4 = EH-^)'^-- (2-2-13) 



Similarly, we introduce the advanced Green's function by the conditions 

{i^-m + B)sl{x,y) = 5\x - y) , supps;;(a;, .) C J^V (2.2.14) 

It has the perturbation expansion 

oo 

~V \ ^ / y -in\k 

k=0 



Ei-'l^)''"m- (2.2.15) 



Having uniquely introduced the causal Green's functions, we can now extend (2.2.7) 
to the case with interaction. Namely, we define the operator km by 

= ^. {sl - S:,) (2.2.16) 
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with the causal Green's functions as given by (2.2.13, 2.2.15). Finally, we also ex- 
tend (2.2.10) to the case with interaction by setting 

\/^. (2.2.17) 

In the next theorem we will give this last relation a precise mathematical meaning and 
show that it gives rise to a unique perturbation expansion. It is most convenient to 
work with the Green's function 

1 _ 



■■= + (2.2.18) 



2 

Furthermore, we introduce the series of operator products 

oo oo oo 

= Y^^-SmBf , hm = Y.{-B Smf B , b> = J^i'B Sm) 

k=0 k=0 k=0 

and set for Q C N 

Pm if n G Q 



Fm{Q,n) 



km iin ^ Q 



Theorem 2.2.1. The relations (2.2.16, 2.2.17) uniquely determine the perturbation 
expansions for and p^ ■ We have the explicit formulas 

oo 

km = Y.^-i7rf^b<km{bmkmf^b> (2.2.19) 

/3=0 



oo L 2 J 



= Y.Y^c{a,(3)Gm{a,P) (2.2.20) 

13=0 a=0 

with the coefficients 

c(0,0) = 1 (2.2.21) 

c(a,m = f;(-ir«2^(^:«:;) ,orp>l (2.2.22) 

n=a+l ' ^ ^ 

and the operator products 

Gm{a,l3) = J2 (-^^)''' 

QeP(/3+l), #Q=2a+l 
^ FfniQi 1) bmkmbm ^miQi 2) bmkmbm ' ' ' ^mkm 

bmFm{Q,P+l)b>, (2.2.23) 

where V{n) denotes the set of subsets of {1, . . . ,n} (we use the convention l\\ = 1 for 
I < 0). 

Proof. An explicit calculation using (2.1.5) shows that {i^ + B — m)bm = 0. Since 
all operator products in (2.2.19) and (2.2.23) have a factor b^ at the left, the operators 
Pm, km are solutions of the Dirac equation, 

(i^ + B — m) Pm = = (i^ + B — m) km ■ 

It remains to verify that the conditions (2.2.16, 2.2.17) are satisfied and to show unique- 
ness. 
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According to (2.2.7, 2.2.18), the advanced and retarded Green's function can be 
written as 

*m ~ + ^71" j ~ ^rn ~ ^71" ■ (2.2.24) 

We substitute the series (2.2.13, 2.2.15) into (2.2.16), 

-. oo 

fe=o 

insert (2.2.24) and expand. This gives a sum of operator products of the form 

Ci B C2 B • • • B C;+i with Cj = k„i or Cj = Sm ■ 

The contributions with an even number of factors km have the same sign for the 
advanced and retarded Green's functions and cancel in (2.2.25). The contributions 
with an odd number of km's occur in each Green's function exactly once and have the 
opposite sign. Using the notation 

we can thus rewrite (2.2.25) in the form 

00 

/=o Qer{i+i), #Qodd 

X CmiQ, 1) B CmiQ, 2)B---B CmiQ, I) B CmiQ, l+l). 

After reordering the sums, this coincides with (2.2.19). 

Next we want to give the relation (2.2.10) a mathematical meaning. To this end, 
we consider m > as a variable mass parameter. Then we can form products of the 
operators Pm,km by manipulating the arguments of the distributions in momentum 
space. For example, using (2.2.4) we obtain 

Pm{k)Pm'{k) = m + m) 5{e - m^) m + m!) 5{k'' - {m'f) 

= {k^ + (m + m')!^ + mm!) 5{rr? - {m'f) Sik^ - m^) 

= ik^ + (m + m')U + mm') ^— 5{m — m') 5{k'^ — m^) 

2m 

= 5{m - m') pra{k) , (2.2.26) 
and similarly from (2.2.5), 

Pm km' = km' Pm = S{m - m') km (2.2.27) 
km km' = 5(?w — W,') Pm ■ (2.2.28) 

We remark that this formalism has some similarity with the bra/ket notation in quan- 
tum mechanics, if the position variable x is replaced by the mass parameter m. Equa- 
tion (2.2.26) can be interpreted that the Pm are the spectral projectors of the free Dirac 
operator; the relations (2.2.27, 2.2.28) reflect the relative minus sign in km for the states 
on the upper and lower mass shell. In particular. This 
relation can be extended to the case with interaction, 

PmPm' — km km' 1 (2.2.29) 
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and gives a meaningful square of (2.2.10) (we will see in a moment that kmkm' vanishes 
for m ^ m'). If our construction ensures that Pm is a positive operator, (2.2.29) is 
even equivalent to (2.2.10). 

Let us compute the product kmkm' explicitly. The definitions (2.2.4, 2.2.5) and (2.2.18, 
2.1.8) yield in analogy to (2.2.26) the formulas^ 



Pm 


^m' 


— ^m' Pm — 


h 

i^m 


^m' 


Sm' km 


^m 


^m' 


= ppf ' 






\m — 



1 



m 



Pm (2.2.30) 



PP ( ^ ) km (2.2.31) 



{Sm - Sm') , (2.2.32) 

where PP(x~^) = ^ lim£^o[(2^ + + {x — is)~^] denotes the principal value. As a 
consequence, the operator products involving the factor Sm'Sm' are telescopic. 



n 
p=0 



km {B Smf {Sm' B^'^ km' =0 for > 1. (2.2.33) 



This allows us to evaluate the following product, 

kmbmbm' km' = 5{m - m) Pm ■ (2.2.34) 
With this formula, we can compute the square of (2.2.19), 

oo 

km km' = S{m - m') J2 i-iT^f^'^^^' b< {km hmf^' Pm {bm kmf^' b> . (2.2.35) 

We could continue the proof by verifying explicitly that the product PmPm' with pm 
according to (2.2.20) coincides with (2.2.35). This is a straightforward computation, 
but it is rather lengthy and not very instructive. We prefer to describe how the 
operator products (2.2.23) and the coefficients (2.2.22) can be derived. In order to 
make the proof more readable, we make the following simplifications. Since the factors 
bm, b^ cancel similar to (2.2.34) in telescopic sums, we can omit them in all formulas 
without changing the multiplication rules for the operator products. Then all operator 
products have km or Pm as their first and last factor, and we can multiply them with 
the rules (2.2.26-2.2.28). Since all these rules give a factor 6{m — m'), we will in any 
case get the prefactor 6(m — m!) as in (2.2.35). Therefore, we can just forget about all 
factors 5{m — m!) and consider all expressions at the same value of m. Furthermore, 
we will omit the subscript 'm' and write the intermediate factors 6 as a dot After 
these simplifications, we end up with formal products of the form 

Fi.F2.F^. ■■■ .Fn with Fj = k or Fj = p (2.2.36) 

and have the multiplication rules 

p2 = j^-i = I ^ pi. = f.p = (2.2.37) 



^Online version: As noticed by A. Grotz, in (2.2.32) the summand 7r^5(m — m') is missing. This 
error is corrected in the paper arXiv:0901.0334 [math-ph]. 
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We must find a positive operator p being a formal sum of operator products (2.2.36) 
such that 

oo 

p2 = ^ (_ •^)2/3i+2/32 _)2/3i ^ ^)2/32 _ (2.2.38) 

/3i,/32=0 

In this way, we have reduced our problem to the combinatorics of the operator products. 
As soon as we have found a solution p of (2.2.38), the expression for pm is obtained by 
adding the subscripts and by inserting the factors b^, bm, b^. Relation (2.2.29) 
follows as an immediate consequence of (2.2.38). 

The basic step for the calculation of p is to rewrite (2.2.38) in the form 

p'^ = p + A with A= {-iTrf^'^^^Hk.f>^'p{.kfl^K (2.2.39) 

(/3i,/32)^(0,0) 

The operator p is idempotent and acts as the identity on A, Ap = pA = A. Therefore, 
we can take the square root of p + A with a formal Taylor expansion, 

P = VFTA =p + f;(-l)"+i ^^^j^ A- , (2.2.40) 

n=l 

which uniquely defines p as a positive operator. 

It remains to calculate A^. If we take the nth power of the sum in (2.2.39) and 
expand, we end up with one sum over more complicated operator products. We first 
consider how these operator products look like: The operator products in (2.2.39) all 
contain an even number of factors k and exactly one factor p. The factor p can be the 
1st, 3rd,. . . factor of the product. Each combination of this type occurs in A exactly 
once. If we multiply n such terms, the resulting operator product consists of a total 
odd number of factors p, k. It may contain several factors p, which all occur at odd 
positions in the product. Furthermore, the total number of factors p is odd, as one 
sees inductively. We conclude that A" consists of a sum of operator products of the 
form 

(k . k p.k.{k.k p.k.{k.k .f-' ■■■ [k.k .)«2-+i p (. A; . A;)«2«+2 (2.2.41) 

with a, Qj > 0. We set /3 = 2q + J2j Qj- Notice that the number of factors p in (2.2.41) 
is 2a + 1; the total number of factors p,k is 2/3 + 1. The form of the operator product 
gives the only restriction < 2a < f3 for the choice of the parameters a, (3. 

Next we count how often each operator product (2.2.41) occurs in the sum: The 
easiest way to realize (2.2.41) is to form the product of the a+1 factors 

[{k.k.y^ p {.k.kf-'+^] [{k.k.Y^+^ p {.k.kY^+^] 

■■■ [(fc.A;.)'?2«+i+ip(.fc.A;)«2«+2] . (2.2.42) 

However, this is not the only way to factor (2.2.41). More precisely, to each factor 
in (2.2.42) we can apply the identities 

{k.k.ypi.k.kf = [{k.k.yp] \p{.k.kY] 

{k .k .fp{.k .kf = [ik.k.yp] [{k . k .f p {. k . kj] 

{k.k.fp{.k. kf = [{k.k .fp{. k . kj-'] \p{.k. kf] . 

By iteratively substituting these identities into (2.2.42), we can realize every factoriza- 
tion of (2.2.41). Each substitution step increases the number of factors by one. The 
steps are independent in the sense that we can fix at the beginning at which positions 
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in (2.2.42) the product shall be split up, and can then apply the steps in arbitrary 
order. There are (a + 1) + (gi — 1) + Yl'j^^ Qj + (92a+2 — 1) = /? — (a + 1) positions 
in (2.2.42) where we could split up the product (in the case = or q2a+2 = 0, the 
counting of the positions is slightly different, but yields the same result). Since we 
want to have n factors at the end, we must choose n — (a + 1) of these positions, which 
is only possible for a + 1 < n < /? and then gives (/? — a — l)!/((n — a — 1)\ {(3 — n)!) 
possibilities. 

Combining these combinatorial factors with the constraints < 2a < /? and a+1 < 
n < /3, we obtain for n > 1 the identity 

oo mm(n-l,[|]) 
I3=n a=0 ^ Qer{/3+l), #Q=2a+l 

X {-iTrf^ F{Q, l).k. F{Q, 2) . k . ■ ■ ■ . k . F{Q, /3 + 1) (2.2.43) 

with F(Q,n) = p for n ^ Q and F{Q,n) = k otherwise. Notice that the last sum in 
(2.2.43) runs over all possible configurations of the factors p, k in the operator product 
(2.2.41) for fixed a, (3. We finally substitute this formula into (2.2.40) and pull the 
sums over a, P outside. This gives the desired formula for p. ■ 



We call the perturbation expansion of the above theorem the causal perturbation 
expansion. It allows us to define the Dirac sea in the presence of an external field 
canonically by 

P'''''{x,y) = ^ {Pm-km)ix,y) . 

In the next section the causal perturbation expansion will be extended to systems of 
Dirac seas, and in §2.4 we will discuss it in detail. 

We conclude this section by showing that, under suitable regularity and decay 
assumptions on the external potentials, all operator products which appeared in this 
section are well-defined and finite. 

Lemma 2.2.2. Let (Cj), < j < n, be a choice of operators Cj G {km,Prm Sm} ■ 
If the external potential B is smooth and decays so fast at infinity that the functions 
B{x), x'^B{x), and x^x^B{x) are integrable, then the operator product 

{CnBCn-iB---BCo){x,y) (2.2.44) 

is a well-defined tempered distribution on X M^. 

Proof. Calculating the Fourier transform of (2.2.44) gives the formal expression 

M{q2,qi) := J ' ' ' J ^^^Cn{q2) B{qi - Pn-i) 

X C„_i(p„_i) Bipn-i - Pn-2) ■ ■ ■ Ciipi) B{pi - qi) Co(gi) , (2.2.45) 

where we consider the Cj as multiplication operators in momentum space and where 
B denotes the Fourier transform of the function B (it is more convenient to work 
in momentum space because the operators Cj arc then diagonal). We will show that 
M{q2, qi) is a well-defined tempered distribution; the Lemma then immediately follows 
by transforming back to position space. 
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The assumptions on B yield that B is and has rapid decay at infinity, i.e. 

sup I ••• g*" (9^^(5)1 < oo 

qeIR'', \k\<2 

for all n, all tensor indices ii, . . . ,in and multi-indices k, (with k = (k^, . . . , k^), \k\ := 
q). As is verified explicitly in momentum space, the distributions km, Pm or Sm are 
bounded in the Schwartz norms of the test functions involving derivatives of only first 
order, more precisely 

|C(/)| < const ||/||4,i with C = km, Pm or and f eS, 

where the Schwartz norms are as usual defined by 

= , ™ax sup \x^djf{x)\. 

\I\<P, \J\<<] ^glR* 

As a consequence, we can apply the corresponding operators even to functions with 
rapid decay which are only C^. Furthermore, we can form the convolution of such 
functions with C; this gives continuous functions (which will no longer have rapid 
decay, however). Since C involves first derivatives, a convolution decreases the order 
of differentiability of the function by one. 

We consider the combination of multiplication and convolution 

F{P2) ■■= 1^ f(P^ - P^) C^P^) 9{Pi) , (2.2.46) 

where we assume that f G C'^ has rapid decay and g E is bounded together with its 
first derivatives, HffHo,! < oo. For any fixed p2, the integral in (2.2.46) is well-defined 
and finite because f(j)2 — ■) g{-) is and has rapid decay. The resulting function F 
is and bounded together with its first derivatives, more precisely 

||i^||o,i < const ||/||4,2 llffllci . (2.2.47) 

After these preparations, we can estimate the integrals in (2.2.45) from the right 
to the left: We choose two test functions f,gE <S(M^,C^) and introduce the functions 

PiiPi) = / ^ ^(P^ - ^o{qi) g{qi) (2.2.48) 

FjiPj) = I ^^B{pj-pj^,)Cj^,{pj^,)Fj_,{pj_,) , l<j<n. (2.2.49) 

The integral (2.2.48) is of the form (2.2.46) and satisfies the above considered as- 
sumptions on the integrand. Using the bound (2.2.47), we can proceed inductively in 
(2.2.49). Finally, we perform the g2-integration, 

M{f,g) = I ^f{q2)Cn{q2)FM. 

We conclude that M is a linear functional on 5(M^, C^) x 5(M^, C^), which is bounded 
in the Schwartz norm ||.||4,i of the test functions. ■ 

Using the language of quantum field theory, we also refer to the summands of the 
perturbation expansions as Feynman diagrams. Then the result of the last lemma 
can be understood from the fact that in an external field one only encounters tree 
diagrams, which are all finite. Clearly, the existence of the perturbation expansion 
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to every order does not imply the convergence of the perturbation series, and we will 
come back to this problem in §2.5. 



2.3. Definition of the Fermionic Projector 

In this section wc introduce the mathematical framework for describing a many- 
fermion system in the presence of an external field. To this end, we first extend the 
construction of §2.2 to a system of Dirac seas of in general different masses, which 
may involve chiral masslcss Dirac seas. Then we introduce particles and anti-particles 
by occupying additional states and creating "holes" in the Dirac seas, respectively. 
Our construction is intended to be so general that it allows us to model the fermion 
configuration of the standard model (see §5.1). For clarity, we postpone the question 
of how the fermionic states are to be normalized to §2.6. 

First, we need to introduce a distribution P^^^(x,y) which describes the system in 
the vacuum. The most general ansatz is to take a direct sum of sums of Dirac seas, 

N g{a) 

psea ^ 0^p-a^ (2.3.1) 

a=l a=l 

where g(a) are positive integers and the summands are Dirac seas of a form 

similar to (2.2.2). The direct sum increases the total number of components of the wave 
functions, the so-called spin dimension, to AN. The direct summands are called sectors, 
and we refer to the indices a and a as the sector and generation index, respectively. 
For each Dirac sea we introduce a mass parameter rrina > 0. In order to allow for 
chiral massless Dirac seas, we introduce (4 x 4)-matrices Xaa with 



1 if maa > 

, XL or xr if ruaa = 



and set 

— l^^aaiVmaa ^niaa) • (2.3.2) 

We refer to P^^^ as defined by (2.3.1, 2.3.2) as the fermionic projector of the vacuum. 
It is sometimes useful to consider P^^ as a matrix in the sectors indices, 

J 9{a.) 

(psea^a _ ^ -^^^ {prriaa ~ ^ruaa) 

^ a=l 

with a,h=\,. . . ,N . 

Since each sector may involve several Dirac seas of different masses, it seems im- 
possible to write the fermionic projector of the vacuum as a solution of a suitable Dirac 
equation, and thus we have no starting point for a perturbation expansion. In order 
to bypass this problem, we replace the sum in (2.3.1) by a direct sum and introduce 
the so-called auxiliary fermionic projector by 

N 9{a) 

psea ^ 00P- (2.3.3) 

a=l a=l 

Using the same notation as for the fermionic projector is usually no problem because 
it will be clear from the context whether the fermionic projector or the auxiliary 
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fermionic projector is meant. In case of potential confusion we write the auxiliary 
fermionic projector as a matrix in the sector and generation indices, 

with a,b = 1, . . . ,N, a = 1, . . . , g{a), j3 = 1,. . . ,g{b)). In this notation, one also sees 
that the fermionic projector can be obtained from the auxiliary fermionic projector by 
taking the so-called partial trace over the generations, 

9{a) g{b) 
a=l 13=1 

We introduce the operators 

N g{a) N g{a) 

P ^ ® ® ^"^«« ' ^ ^ ® ® 
a=l a=l a=l a=l 

and define the matrices 

N g{a) ^ N g{a) 

^ ^ ® ® ' ^ ^ 



I m, 



m 

a=l a=l a=l a=l 



act 7 



which are called chiral asymmetry matrix and mass matrix, respectively. Here m is an 
arbitrary mass parameter; a convenient choice is m = maxa^an^aa- These operators 
act on direct sums of Dirac wave functions, i.e. on functions of the form ^' = (^'(""^(x)) 
with ^i^"") a 4-component Dirac spinor. On these wave funtions, we introduce the spin 
scalar product by 

n g{a) 

^* I = 5m I ^^'"'^^Dirac , (2-3.5) 

a=l a=l 

where -<.|.;^Dirac is the usual spin scalar product on Dirac spinors (1.2.8). In general- 
ization of (1.5.23) we also introduce the indefinite inner product 



<^ I $> = / I $^ d/i . (2.3.6) 
Jm 

Then the operators p and k are Hermitian with respect to <.| •>, and the mass matrix Y 
is Hermitian with respect to the spin scalar product. Using the above notation, we 
can write the auxiliary fermionic projector as 

P^<=-(x, y) = ip{x, y) - k{x, y)) . (2.3.7) 

Since = for Xi 1 and since the operators Pm=Oi km=o are odd, we have 
alternatively 

P'^^ix, y) = \ (pix, y) - Hx, y)) x* , (2.3.8) 

where X* is the adjoint with respect to the spin scalar product. The auxiliary fermionic 
projector is a solution of the free Dirac equation 

{i^^-mY) P^^^{x,y) = 0. (2.3.9) 

Our strategy is to extend the definition of the auxiliary fermionic projector to the 
interacting system and then to get back to the fermionic projector by taking the 
partial trace (2.3.4). 
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In order to describe the system of Dirac seas in the presence of an external field, 

(aa) ■ 

m 

{i^^ + B-mY)P^''''{x,y) = 0. (2.3.10) 



we insert a differential operator B = (S/^m ) into the Dirac equation (2.3.9), 



We always assume that B is Hermitian with respect to the inner product <.|.>. The 
causal perturbation expansion for the operators k and p can be carried out exactly as 
in §2.2: We define the advanced and retarded Green's functions by 

N g{a) N g{a) 

Their perturbation expansion is, in analogy to (2.2.13, 2.2.15), uniquely given by 

oo oo 

= ^(-s^^)^s^, = ^{-s'-Bfs^. (2.3.11) 

fe=0 k=i) 

The method of Theorem 2.2.1 now yields the following result. 

Theorem 2.3.1. The perturbation expansion forp and k is uniquely determined 
by the conditions 

1 ~A\ ~ formally 



A; = _^ (gV _ ^~A) ^ ~ _ (2.3.12) 

We have the explicit formulas 

oo oo [ 2 J 

k = ^{-iiTfH<k{hkfH> , p = ^^c(q,/3) G(a,/3) 

/3=0 /3=0 a=0 



with 



c(0,0) = 1, 

n=a+l ^ ^ 



and 



G{f,9) = Yl (-^^)'^ 

QeV{l3+l), #Q=2a+l 

X b< FiQ, 1) bkb FiQ, 2) bkb ■■■ bkb F{Q, /3 + 1) 6> , 
where V{n) is the set of subsets of {1, ... , n} and where we use the notation 

oo oo oo 

b< = J2(-sB)\ b = b> = J](-Bs)^ 

A;=n fc=0 fc=0 

The contributions to this perturbation expansion are all well-defined according to 
Lemma 2.2.2. 
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After this straightforward generalization, we come to the more subtle question of 
how to define P^'^'^ when a chiral asymmetry is present. The obvious idea is to set in 
generalization of (2.3.7) 

P-=^(x,j/) = X^{p-k){x,y). (2.3.13) 

This is not convincing, however, because we could just as well have defined P^^^{x,y) 
in analogy to (2.3.8) by P*^*^*^ = ^{p - k) X* , which does not coincide with (2.3.13) 
as soon as X, X* do not commute with B. Actually, this arbitrariness in defining the 
Dirac sea reflects a basic problem of the causal perturbation expansion for systems 
with chiral asymmetry. In order to describe the problem in more detail, we consider 
the perturbation calculation for k to first order. According to (2.3.11, 2.3.12), 

k = k - ^ {s'' B s"" - s"" B s"") + 0(^2) (2.3.14) 

= k - sBk - kBs + 0{B'^) . 

This expansion is causal in the sense that k{x, y) only depends on B in the "diamond" 
(L^ n Ly) U (L^ n L^), as is obvious from (2.3.14). It is not clear, however, how to 
insert the chiral asymmetry matrix into this formula. It seems most natural to replace 
all factors k by Xk, 

(Xk) = Xk - sBXk - XkBs + 0{B^). (2.3.15) 

Unfortunately, this expression cannot be written similar to (2.3.14) with the advanced 
and retarded Green's functions, which means that the causality of the expansion is 
in general lost. In order to avoid this problem, one might want to insert X at every 
factor s,k, 

(Xk) = Xk - XsBXk - XkBXs + 0{B^) 

= Xk - ^ {Xs^^ BXs"^ - Xs^ BXs^) + 0{B^) . (2.3.16) 

ZTTl 

This expansion is causal similar to (2.3.14). In general, however, it does not satisfy 
the Dirac equation (i^ + B — m) (Xk) = 0, which does not seem to be what we want. 

The only way to resolve this problem is to impose that the perturbation expan- 
sions (2.3.15) and (2.3.16) should coincide. This yields a condition for the operator B, 
which can be characterized as follows. We demand that 

Xs'^BXs'^ = s'^BXs'^ = Xs'^Bs'^ . (2.3.17) 

Since the operator s^=o o'^'^' have Xs"^ = s'^X*. Substituting into the second 
equation of (2.3.17) yields the condition X* B = B X. Since X is idempotent, this 
condition automatically implies the first equation of (2.3.17). We formulate the derived 
condition for the whole Dirac operator i^ + B — niY and thus combine it with the fact 
that chiral fermions are massless (i.e. X*Y = YX = Y) and that X is composed of 
chiral projectors (which implies that X*^ = ^X). 

Def. 2.3.2. The Dirac operator is called causality compatible with X if 



X* {i^ + B- mY) = {i^ + B- mY) X 



(2.3.18) 
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In the perturbation expansion to higher order, the condition (2.3.18) allows us 
to commute X through all operator products. Using idempotence = X, we can 
moreover add factors X to the product; in particular, 

X CiBCiB ■ ■ ■ BCn = XCiBXCiB ■ ■ ■ BXCn with Cj = p, Cj = koiCj = s. 

This ensures that the perturbation expansion is also well-defined to higher order. For a 
Dirac operator which is causality compatible with X, the auxiliary fermionic projector 
is defined canonically by (2.3.13). 

So far the auxiliary fermionic projector describes a system of Dirac seas in the 
presence of an external field. In order to insert particles and anti-particles into the 
system, we add the projectors on particle states and substract the projectors on anti- 
particle states, 

P{x,y) = P'''^{x,y) 

"/ ria 

Cnorm J2\Mx>^My)\, (2-3.19) 

k=l 1=1 

where and are an orthogonal set of solutions of the Dirac equation, and the 
must lie in the image of P^^ (for the normalization constant c„orm see §2.6). The 
parameters Uf and Ha denote the total number of particles and anti-particles, respec- 
tively. We usually avoid the issue of convergence of the sums in (2.3.19) by assuming 
that nf,na < oo, but one could clearly also consider an infinite number of particles 
and/or anti-particles. Finally, the fermionic projector is obtained from this expression 
by taking similar to (2.3.4) the partial trace, 

g{a) gib) 

a=l 13=1 

Theorem 2.3.1 together with (2.3.13, 2.3.19, 2.3.20) yields a mathematical frame- 
work for describing a general many-fermion system in the presence of an external 
field. Our construction makes Dirac's concept of a "sea of interacting particles" math- 
ematically precise. Apart from the causality compatibility condition (2.3.18) and the 
regularity conditions in Lemma 2.2.2, the operator B is completely arbitrary. We point 
out that we do not use the fermionic Fock space formalism of canonical quantum field 
theory; the connection to this formalism will be explained in §3.2 and Appendix A. 

2.4. Interpretation and Consequences 

With the definition of the fermionic projector we radically departed from the usual 
concept of the Dirac sea as "all negative-energy solutions" of the Dirac equation. 
Instead, we used causality in a particular way. In order to clarify the connection 
between our definition and the usual concept of the Dirac sea, we now describe how 
the above constructions simplify in the special situation that B is static. If considered 
as multiplication operators, static potentials map functions of positive and negative 
frequency into functions of positive and negative frequency, respectively. Since the 
operators p, k and s are diagonal in momentum space, they clearly preserve the sign 
of the frequency too. Thus 

[n±,p] = [n±,fc] = [n±,s] = [u^,b] = o, (2.4.1) 



46 



2. THE FERMIONIC PROJECTOR IN THE CONTINUUM 



where the operators 11='= are the projectors onto the states of positive and negative 
frequency, respectively (i.e. in momentum space, 11='= are the operators of multiphcation 
by the functions Q{±k^)). The operators p and k differ only by a relative minus sign 
for the states of positive and negative frequency, 

Using this relation together with (2.4.1), we can replace pairs of factors p by pairs of 
factors k. For example, 

...pB---pB--- = ■■■pB---pB--- {U+ + U-) 

= U+{---kB---kB---) + U-{---{-k)B---{-k)B---) 

= ---kB-'-kB--- , (2.4.2) 

where the dots '• • • ' denote any combination of the operators s, k, p and B. This allows 
us to simplify the formula for p by using only one factor p in every operator product. 
After going through the details of the combinatorics, one obtains the formula 



p = Y^{-mfH<p{hkfH'' 



b=0 

Thus the fermionic projector (2.3.13) can be written as 



lx{p-k) 



{hkf^ b> 



b=0 

This equation shows that P^'^^ is composed of all negative-frequency eigenstates of 
the Dirac operator (notice that the expression in the brackets [• • • ] is the fermionic 
projector of the vacuum and that all other factors preserve the sign of the frequency). 
We conclude that for static potentials our definition reduces to the usual concept of 
the Dirac sea as "all negative-energy states." 

In order to get a better understanding of the time-dependent situation, we next 
consider a scattering process. For simplicity, we consider a system of one Dirac sea and 
assume that the scattering takes place in finite time io < ^ < ^i- This means that the 
wave functions satisfy the Dirac equation (2.1.1) with B supported in a finite time 
interval, 

B{t,x) = if t ^ [to,ti] . (2.4.3) 

As a consequence, '^{t,x) is for t < to a solution of the free Dirac equation. We 
uniquely extend this free solution to the whole Minkowski space and denote it by ^'in, 

(z^-m)*in = and *in(i,^) = *(i,^) for t < to- 
Similar ly, '^{t, x) is also for t > t\ s, solution of the free Dirac equation; we denote its 
extension by ^'outj 

(z^-m)^'out = and ^ovx{t,x) = '^{t,x) for t > ti. 

The wave functions ^'in and ^out arc called the incoming and outgoing scattering 
states, respectively. Recall that the dynamics of the wave functions is described in- 
finitesimally by the Dirac equation in the Hamiltonian form (1.2.18), where h is a 
symmetric operator on the Hilbert space (Ti, (., .)) with scalar product (1.2.17). Inte- 
grating this equation from tQ to ti, we obtain a unitary operator S which maps the 
incoming scattering states to the corresponding outgoing states, 

*out = S^,n. (2.4.4) 
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The operator S is called scattering operator or S -matrix. 

Using the scattering states, we can introduce fermionic projectors which describe 
the vacua in the asymptotic past and future: For an observer in the past t < to, the 
external potential is zero. Thus it is natural for him to describe the vacuum with the 
free Dirac sea (2.2.1). If this Dirac sea is extended to the whole Minkowski space with 
external potential, one gets the object 

P^(x,y) = --^ / (2.4.5) 

where the wave functions '^^g^ are the solutions of the Dirac equation (2.1.1) whose 
incoming scattering states are the plane wave solutions '^^se: 

{i^ + B- m) = and (*^, = ■ 

Using the support conditions (2.4.3, 2.1.9), we can express the state ^^g^ in a pertur- 
bation series, 

oo 
n=0 

Substituting this formula into (2.4.5) we obtain for a perturbation expansion which 
involves only the retarded Green's functions, 

oo 

P^ = {-s^ BY^ P""^ {-B s^T\ (2.4.6) 

ni,n2=0 

where P^^ stands for the free Dirac sea (2.2.1). Accordingly, an observer in the future 
t > to describes the vacuum by the fermionic projector 

P\x,y) = -"^Y^L \'^Ps-^<'^l-\ dl^p, (2.4.7) 

where 

{i^ + B-m)^'ig, = and (*Uout = ^pse ■ 
Its perturbation expansion involves only the advanced Green's function, 

oo 

= ^ (-s^ H)"i (-B s^)"2 . (2.4.8) 

ni.n2=0 

Using (2.4.4) in (2.4.5, 2.4.7), we can describe the fermionic projectors in the asymp- 
totic past and future with the S-matrix by 

^ pvac ^ pv^^ ^ pA^^ ^ SP^'^'^S-^ , py = S-^P^'^'^S . (2.4.9) 

What makes the scattering process interesting is the fact that the vacua in the 
asymptotic past and future in general do not coincide. Consider for example the 
physical system described by the fermionic projector P := P^. For the observer in the 
past, the system is in the vacuum. However, if P^ 7^ P^, the system will not be in the 
vacuum for the observer in the future. This means that for him, positive frequency 
states are occupied and negative frequency states are unoccupied and thus the system 
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contains particles and anti-particles. More precisely, if we write the fermionic projector 
in analogy to (2.3.19) as 

P{x,y) = P\x,y) 

"l~ Cnorm Y,\'^k{x)^<^k{y)\ - c„orm^|$Ka^)^^^Ky)l , (2-4.10) 

k=l 1=1 

then the and are the wave functions of the particles and anti-particles, respec- 
tively. These particles and anti-particles are physical reality; the observer in the future 
can detect them by making suitable experiments. This is the physical effect of pair 
creation. Using (2.4.9) one can express the pair creation completely in terms of the 
S'-matrix. Other scattering processes are described similarly. 

We point out that describing the scattering process with the two observers in the 
past and future is merely a matter of convenience. The physical process can be de- 
scribed equivalently (although maybe less conveniently) in the reference frame of any 
other observer. To give a concrete example, we consider an observer in the future who 
is in a reference frame moving with constant acceleration. This leads to the so-called 
Unruh effect , which we now briefly outline (for details see e.g. [Wa2]). For the accel- 
erated observer, space-time is stationary (i.e. his time direction is a Killing field, but 
it is not a unit normal to the hypersurfaces t = const) , and this allows him to use the 
separation ansatz (2.1.2) with t his proper time. The sign of uj gives him a splitting 
of the solution space into solutions of positive and negative energy. Using Dirac's hole 
interpretation corresponding to this splitting, he finds for the many-fermion system 
described by P an infinite number of particles and anti-particles in a thermal equi- 
librium. This bizarre effect shows that the interpretation of the physical system in 
terms of particles and anti-particles does depend on the observer. Nevertheless, the 
Unruh effect does not contradict the pair creation experiments made by the future 
observer at rest. Namely, if the accelerated observer wants to explain the experiments 
by the future observer at rest, he must take into account that he himself is feeling a 
gravitational field, and that for him the experimental apparatus used by the observer 
at rest is in accelerated motion. It turns out that these additional effects just com- 
pensate the Unruh effect, so that the predictions by the accelerated observer are in 
complete agreement with the observations of the particles and anti-particles in (2.4.10) 
by the future observer at rest. More generally, all quantities which can be measured 
in experiments can be expressed in terms of the S-matrix. Since the S-matrix does not 
depend on the particle/anti-particlc interpretation, it is clear that all experiments can 
be explained equivalently in any reference frame. 

We just saw that the particle/anti-particle interpretation of a fermionic system may 
depend on the observer. Actually, the situation is even worse for an observer in the 
time period to < t < ti when the interaction takes place. For him, the system is neither 
static nor stationary. Therefore, he has no notion of "negative-energy state" , and thus 
for him the particle/anti-particle interpretation completely breaks down. Taking into 
account that a scattering process is an idealized process and that in a real physical 
situation there will be no region of space-time where no interaction takes place, we 
come to the disillusioning conclusion that for a local observer under generic conditions, 
a many-fermion system has no interpretation in terms of particles and anti-particles. 

The causal perturbation expansion yields a canonical object P^*^^ which describes 
the Dirac sea in the scattering process, even in the region with interaction to < ^ < ^i- 
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Its construction is explicitly covariant and independent of a local observer. Decompos- 
ing the fermionic projector in the form (2.3.19), we obtain a canonical interpretation 
of the many-fermion system in terms of particles and anti-particles. One should keep 
in mind that P^^'^ does not correspond to the vacuum of any local observer, but is a 
global object of space-time. As a consequence, also the particle/anti-particle interpre- 
tation in (2.3.19) can be associated only to an abstract "global observer" in space-time. 
More specifically, comparing Theorem 2.3.1 and (2.3.13) with (2.4.6, 2.4.8), one sees 
that P^^^ coincides neither with nor with P"^ . Since its perturbation expansion 
involves both retarded and advanced Green's functions, it can be considered as being 
some kind of "interpolation" between P^ and P^. 

Let us now discuss our assumption on the potential B as being external. As ex- 
plained at the end of §2.1, this is no restriction in principle because one can first solve 
the physical equations of the coupled system and then can define P^^^ for the external 
potential B as given by the solution of the coupled system. Clearly, this procedure 
cannot be carried out in practice, but this is of no relevance for the theoretical consid- 
erations here. The important point is that P^^^ is not defined locally; for its definition 
we need to know B in the whole space-time. This is puzzling because the conventional 
physical equations are local and causal, and this is the first time that an object appears 
which is defined in a non-local way. One might conclude that P^^ is an object which is 
not compatible with causality and should therefore have no physical significance. Our 
concept is the opposite: We regard the appearance of a non-local object as a first hint 
that locality and causality should be given up in the strict sense. In order to formulate 
physical equations which could replace the conventional local and causal equations, we 
shall consider the fermionic projector as the fundamental object in space-time. 

Before we can make these ideas precise in Chapter 3, we need to analyze the 
fermionic projector in the continuum in more detail. One task is to understand what 
"causality" of the causal perturbation expansion means precisely. At the moment, 
we know that causality was used for the definition of P^^^^ but that nevertheless the 
fermionic projector is a nonlocal and non-causal object. We need to find out how these 
seemingly contradicting facts fit together. Also, we must understand better how P^^^ 
depends on B. More specifically, we need to analyze what information on the external 
potentials is encoded in P^'^^^ and how this information can be extracted. Finally, 
we must specify how the fermionic states in (2.3.19) are to be normalized. The next 
sections provide the mathematical tools for answering these questions. 

2.5. The Light-Cone Expansion 

The light-cone expansion is a very useful technique for analyzing the fermionic 
projector near the light cone. In order to give a brief but self-contained introduction, 
we will explain the methods and results of [F6] leaving out many proofs and technical 
details. Our setting is that of §2.3 with several sectors and generations (2.3.1). It 
suffices to consider the auxiliary fermionic projector because the fermionic projector 
is obtained from it simply by taking the partial trace (2.3.20). We again assume that 
the Dirac operator in (2.3.10) is causality compatible (2.3.18) and that the operator B 
is Hermitian with respect to the inner product (2.3.6). Furthermore, we assume as 
in [F6] that ;B is a multiplication operator composed of chiral and scalar /pseudoscalar 
potentials, 

B{x) = Xl4r{x) + XrM^) + Hx) + ip^{x). (2.5.1) 
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We note for clarity that these potentials may act non-trivially on the sectors and 
generations (e.g. writing the right-handed potential as a matrix, Ar = {Afi)^^'^^ the 
matrix elements are independent vector fields with the only constraint that the matrix 
must be Hermitian). In particular, the potentials in (2.5.1) do in general not commute, 
and thus we must in products be careful with the order of multiplication. For an 
operator B which involves bilinear and gravitational potentials see [F5]. 

Dep. 2.5.1. A distribution A{x, y) on x is of the order 0{{y - xfP), p G Z, 
if the product 

{y-x)-^P A{x,y) 

is a regular distribution (=a locally integrable function). It has the light-cone ex- 
pansion 

oo 

A{x,y) = ^A[^l(x,y) (2.5.2) 

3=9 

with g ^ if the distributions A^\x,y) are of the order 0{{y — x)'^^) and if A is 
approximated by the partial sums in the sense that for all p> g, 

p 

A{x, y) - ^((y - xf^'^^) ■ (2-5.3) 

The light-cone expansion describes the behavior of a distribution near the light 
cone. More precisely, the expansion parameter (y — x)^ vanishes if y lies on the 
light cone centered at x, and thus the distributions A^\x,y) approximate A{x,y) 
for y in a neighborhood of L^. The first summand A^^\x,y) gives the leading order 
of A{x, y) on the light cone, ^[^"^-'^1 gives the next order on the light cone, etc. If 
the distribution A is singular on the light cone, the parameter g will be negative. 
Note that the distributions ^1^1 are determined only up to contributions of higher 
order 0{{y — .x)^-'"'"^), but this ambiguity will not lead to any problems in what follows. 
We point out that we do not demand that the infinite series in (2.5.2) converges. 
This series is defined only via the approximation by the partial sums (2.5.3). Despite 
this formal character of the series, the light-cone expansion completely describes the 
behavior of A{x^ y) near the light cone. This situation can be seen in analogy to 
the Taylor expansion of a smooth, nonanalytic function. Although the Taylor series 
does in general not converge, the Taylor polynomials give local approximations of 
the function. An important difference to a Taylor expansion is that the A^^^{x,y) 
approximate A{x,y) even for points x and y which are far apart. We only need that y 
is close to the light cone Lx, which is an unbounded hypersurface in R"^. In this sense, 
the light-cone expansion is a non-local expansion. 

For clarity, we begin with the light-cone expansion for the causal Green's functions, 
and we will later extend the results to the fermionic projector. In order to get a first 
idea of how the light-cone expansion can be carried out, we consider the free advanced 
Green's function s^^^ as defined by (2.1.7, 2.1.8). We can pull the Dirac matrices out 
of the Fourier integral by setting 

.sl{x,y) = {z<^^ + m)Sl,{x,y), (2.5.4) 

where S^2 is the advanced Green's function of the Klein-Gordon operator. 
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This Fourier integral can be computed explicitly; we expand the resulting Bessel func- 
tion in a power series, 

- 4 - S g JUTW. ^ • 

where we used the abbreviation ^ = y — x. This calculation shows that {x, y) has 
a 5{{y — a:;)^)-like singularity on the light cone. Furthermore, one sees that is a 
power series in m? . The important point for us is that the contributions of higher 
order in m? contain more factors {y — x)^ and are thus of higher order on the light 
cone. More precisely, 

(d^) ^^rn^m^=o{^^y) is of the Order 0{{y - xf^-^) . (2.5.7) 

According to (2.5.4), the Dirac Green's function is obtained by taking the first partial 
derivatives of (2.5.6). Thus s^{x,y) has a singularity on the light cone which is even 
~ (5'((y — x)^). The higher order contributions in m are again of increasing order on 
the light cone. This means that we can view the Taylor expansion of (2.5.4) in m, 

oo 2n / /7 \ " 

= E ^ + I m2=o(^, y) > (2-5.8) 

n=0 ■ ^ 

as a light-cone expansion of the Green's function. 

Writing the light-cone expansion of s^^ in the form (2.5.8) clearly is more convenient 
than working with the explicit formula (2.5.6). This is our motivation for using an 
expansion with respect to the mass parameter also in the presence of the external field. 
Expanding the perturbation expansion (2.3.11) in m gives a double series in powers 
of m and B. In order to combine these two expansions in a single perturbation series, 
we write the mass matrix and the scalar /pseudoscalar potentials together by setting 

Yl{x) = y - 1 ($(x) + is(x)) , Yr{x) = y - 1 ($(x) - is(x)) . (2.5.9) 

The matrices Yj^iJi{x) are called dynamical mass matrices; notice that = Yr. With 
this notation, we can rewrite the Dirac operator as 

i^ + B-mY = i^ + B with (2.5.10) 

B = ^L{4^R-mYR) + XRi^L-^^^L) ■ (2.5.11) 

For the light-cone expansion of the Green's functions, we shall always consider B as 
the perturbation of the Dirac operator. This has the advantage that the free theory 
consists of zero-mass fermions, and thus the Green's functions of the free Dirac operator 
have the simple form 

s^{x,y) = i^,Sl,^^{x,y), s\x,y) = i^, Sl,^^{x,y) . (2.5.12) 

The Green's functions with interaction are given in analogy to (2.3.11) by the pertur- 
bation series 



5^ 



Y.i-s' Bfs^ , 5^ = ^(-s^ 5)^=5^. (2.5.13) 



k=0 k=0 
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We remark that this perturbation expansion around zero mass is most convenient, 
but not essential for the Ught-cone expansion; see [F5] for a Ught cone expansion of a 

massive Dirac sea. 

Our first goal is to perform the light-cone expansion of each Feynman diagram 
in (2.5.13). Using an inductive construction based on Lemma 2.5.2 below, this will 
give us the result of Theorem 2.5.3. Since the construction is exactly the same for 
the advanced and retarded Green's functions, wc will omit all superscripts and 
The formulas for the advanced and retarded Green's functions are obtained by adding 
either superscripts or to all operators s and S. For the mass expansion of the 
operator , we set a = and introduce the notation 

S^'^ = (^)'^a|a=0 (^>0). (2.5.14) 

Let us derive some computation rules for the S^'^ . Sa satisfies the defining equation of 
a Klein-Gordon Green's function 

(-□x - a) Sa{x, y) = (5^(x - y) . 

Differentiating with respect to a yields 

-n.S^'\x,y) = \o6\x-y) + lS^'-'\x,y) (/ > 0). (2.5.15) 

For I = 0, this formula does not seem to make sense because 5*^"^^ is undefined. 
However, the expression is meaningful if one keeps in mind that in this case the factor 
/ is zero, and thus the whole second summand vanishes. We will also use this convention 
in the following calculations. Next, we differentiate the formula for Sa in momentum 
space, 

with respect to both p and a. Comparing the results gives the relation 

^,Saip) = -2p, ISaip) , 
or, after expanding in the parameter a, 

^S^'\p) = -2pkS^'+'\p) (Z>0). (2.5.17) 
This formula also determines the derivatives of S'(') in position space. Namely, 

(2.5.17) ^_ f . -ip(x-y) 

~ 2 J (27r)4 dp^ 

_i_ r d^p o(l-l)^ . _d_ ip(cc-y) 

2j (27r)4'' ^> dp^ 
= ^{y-x)kS^'-'\x,y) il>l). (2.5.18) 
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We iterate this relation to compute the Laplacian, 

= -i^((y-a;)^s('-i)(,:,y)) 

= 2S^'-^\x,y) -\{y~xfS^'-''\x,y) {l>2). 
After comparing with (2.5.15), we conclude that 

{y-xf S^^\x,y) = -AlS'^^+^\x,y) {l>0). (2.5.19) 

Furthermore, S^^\x,y) is only a function of {y — x), and thus 

^:5«(x,y) = -l^S^'\x,y) {l>0). (2.5.20) 



dx^ ' dy^ 

Finally, it is convenient to use the identity (2.5.18) also in the case I = and to use it 
as the definition oi S~^, 

^S^'\x,y) = ^{y-x)kS^'-'\x,y) (I > 0). (2.5.21) 

Notice that S^"^^ itself remains undefined, only the combination {y — x)kS^~^^ is given 
a mathematical meaning as the partial derivative of the distribution 2S^^^ . 

The next lemma gives the light-cone expansion of an operator product where a 
potential V is sandwiched between two mass-derivatives of the Green's function. This 
expansion is the key for the subsequent iterative light-cone expansion of all Feynman 
diagrams. Wc always assume without saying that the potentials satisfy the regularity 
conditions of Lemma 2.2.2. 

Lemma 2.5.2. The operator product S^^^ V S^^^ with l,r > has the light-cone 
expansion 



oo „i 

(5«F5(''))(x,y) = «'(l-«r («-a'r 

X (□"y)|,,+(i_,), da S(-+'+'-+i)(x,y) . (2.5.22) 



The fact that line integrals appear in this lemma can be understood in analogy to 
the method of integration along characteristics (see e.g. [Ta, Fl]) for a solution of an 
inhomogeneous wave equation (for a more detailed discussion of this point see [F5]). 
The advantage of the above lemma is that it gives a whole series of line integrals. A 
further difference is that the left side is an operator product, making it unnecessary 
to specify initial or boundary values. 

Proof of Lemma 2.5.2. Our method is to first compute the Laplacian of both 
sides of (2.5.22). Comparing the structure of the resulting formulas, it will be possible 
to proceed by induction in I. 

On the left side of (2.5.22), we calculate the Laplacian with the help of (2.5.15) to 



-□,(5«y5«)(a;,y) = 5i^^V{x) S^'\x,y) + I {S^'-^^ V S^'y){x,y) . (2.5.23) 
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The Laplacian of the integral on the right side of (2.5.22) can be computed with 
(2.5.21) and (2.5.15), 

a' (1 - af (a - a^f (□"1^)|ay+(i-a). da 5("+'+'^+i)(x, y) (2.5.24) 

^0 

= - ay+^ [a - o?T (□"+V)|,,+(i_,), da S^^^'^^^'-\x,y) 

Jo 

-I a' (1 - q)"+i {a - a^T 
Jo 

X (5fen"l^)|«,+(i_„), da {y - x)"^ S(^+'+'\x,y) 
+{n + l + r + l) 

X a' (1 - ay {a - a^nn^V)\^y+^,_a)x da S^^+'+'\x,y) . 
Jo 

In the second summand, we rewrite the partial derivative as a derivative with respect 
to a and integrate by parts, 

a' (1 - aY+' {a - a^T (5fcn"F)|,j,+(i_,), da {y - xf 

Jo 

= a' (1 - aY+^ {a - a^f ^(□"^)|a,+(i-a). da 

= -5n,Q V{x) 

-{n + I) f (1 - aY+^ {a - «2)-i (□"l^)|„^+(i_,), da 
Jo 

+{n + r + l) f a' (1 - aY (a - a^)" (□"l^)|ay+(i-,,), da 
Jo 

= -Sn,0 Sift y{x) 

-n C a' (1 - a)^+2 (a - a^Y~^ (□'^F)|,j,+(i_„), da 
Jo 

+{n + l + r + l) I a^{l- aY (a - a'^Y (□"V^)|a2,+(i-a)x da 
Jo 

-I [ a'-' (1 - aY {a - a^)" (□"F)|„,+(i_„), da . 
Jo 

We substitute back into the original equation to obtain 

(2.5.24) = 6n,oSi,oV{x)S^''\x,y) 

+1 C a'-' (1 - aY {a - a^Y (□"^)|a,+(i-a). da S^^+'+^\x,y) 
Jo 

- Ca' (1 - aY+^ {a - a^Y (□"+V)|a,+(i-a). da S^^+'+'+'\x , y) 
Jo 

+n C a' (1 - a)^+2 _ ^2y-i (^ny^ S^^+'+'\x,y) . 

Jo 
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After dividing by n! and summing over n, the last two summands are telescopic and 
cancel each other. Thus we get 

oo „i 

oo 1 

= 5i,oV{x)S('\x,y) + ly^-J a^-^(l-ar(a-a2r 

n=0 

X da . (2.5.25) 

We now compare the formulas (2.5.23) and (2.5.25) for the Laplacian of both 
sides of (2.5.22). In the special case / = 0, these formulas coincide, and we can 
use a uniqueness argument for the solutions of the wave equation to prove (2.5.22): 
We assume that we consider the advanced Green's function (for the retarded Green's 
function, the argument is analogous). For given y, we denote the difference of both 
sides of (2.5.22) by F(x). Since the support of F{x) is in the past light cone x G Ly, F 
vanishes in a neighborhood of the hypersurface Ti = {z ^M!^\z^ = + 1}. Moreover, 
the Laplacian of F is identically equal to zero according to (2.5.23) and (2.5.25). We 
conclude that 

□ F = and = dkF\u = 0. 

Since the wave equation has a unique solution for given initial data on the Cauchy 
surface F vanishes identically. 

The general case follows by induction in /: Suppose that (2.5.22) holds for given 
I (and arbitrary r). Then, according to (2.5.23, 2.5.25) and the induction hypothesis, 
the Laplacian of both sides of (2.5.22) coincides for / = / + 1. The above uniqueness 
argument for the solutions of the wave equation again gives (2.5.22). ■ 



The above lemma can be used iteratively for the light-cone expansion of more 
complicated operator products. To explain the method, we look at the example of 
three factors S'^^^ and two potentials W, 

(5(0) W^5(°))(x,y) = j <fzS^'^\x,z)V{z) {S^'^'^W S^'^'^){ z, y) . (2.5.26) 

Having split up the operator product in this form, we can apply Lemma 2.5.2 to the 
factor S^^^WS^^\ 

= E^ / (^'^'5(°)(x,z)|y(z)^\a-aY(n"W^)K+(i-a).d«}5("+i^(^,y). 

Now we rewrite the 2:-integral as the operator product {S^^^ gyS^'^^){x, y), where gy{z) is 
the function in the curly brackets. The y-dependence of gy causes no problems because 
we can view y as a fixed parameter throughout. Thus we can simply apply Lemma 
2.5.2 once again to obtain 

A^^^rn\n\h Jo 

X nT [Viz] (□"W^)|..+(i-a).)u=,,+n_,). 5(-+-+2)(x,y) . 
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Now the Laplacian can be carried out with the Leibniz rule. Notice that the 
manipulations of the infinite series are not problematic because the number of terms 
is finite to every order on the light cone. 

The Feynman diagrams in (2.5.13) can be expanded with this iterative method, 
but they are a bit more difficult to handle. One complication is that pulling the Dirac 
matrices out of the Green's functions according to (2.5.12) gives one additional partial 
derivative per Green's function. However, this causes no major problems, because 
these partial derivatives can be carried out after each induction step by differentiating 
through the light-cone expansion of Lemma 2.5.2. Another issue is to keep track of 
the chirality of the potentials. To this end, one must use that the zero mass Green's 
function s and the factors -4-l/r odd, whereas the dynamical mass matrices are 
even. Thus the chirality of the potentials changes each time a dynamical mass matrix 
appears, as in the example of the operator product 

Xls4l^---^4l^^ls4rS---s4rsYrs4-l^-- - ■ (2.5.27) 

The last difficulty is that partial derivatives inside the line integrals may be contracted 
with a factor (y — x), like for example in the expression 

/ {y - xy djViay+(i-a)x da . 
Jo 

Such derivatives act in direction of the line integral and arc thus called tangential. 
Writing them as derivatives with respect to the integration variable, we can integrate 
by parts, e.g. 

J (y- xy djViay+(i-a)x da = J -j^V{ay + (1 - a)x) da = V{y) - V{x) . 

Going through the calculations and the combinatorics in detail, one finds that with 
such integrations by parts we can indeed get rid of all tangential derivatives, and one 
ends up with terms of th following structure (for the proof see [F6]). 

Theorem 2.5.3. (light-cone expansion of the k^^ order Feynman diagram) 

Using a multi-index notation and the abbreviation 

r[l, r I n] f{z) dz := / da a^ (1 - aY (a - a^)" f{ay + (1 - a)x) , (2.5.28) 

Jx Jo 

the light-cone expansion of the k*^ order contribution to the perturbation series (2.5.13) 
can be written as an infinite sum of expressions of the form 

XcC{y- x)^ wW(a;) r [h,r^ \ dz^ W^W(zi) f [l2,r2 \ ns] dz2 W^''\z2) 

J X J Z\ 

[la, Va I na] dz^ w(")(z„) S^^\x, y) , a<k. (2.5.29) 



Here the factors W^^^ are composed of the potentials and their partial derivatives, 

i^(/3) ^ {d^'^0n^'^0vi''^^l^^) . . . {d^''0n^>'Pv}^^^^^^^^^ (2.5.30) 

with ai = 1, ap^i = 6^ + 1, 6^ > O/j — 1 (in the case bp = ap — 1, W^(^) is identically 
equal to one) and b^ = k. Furthermore, c,Ca G {L,R} are chiral indices, C is a 
complex number, and the parameters la, ra, ria, and Pa are non-negative integers. The 
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functions Vj^\^ coincide with any of the individual potentials in (2.5.11) with chirality 
Ca, i.e. 

fJ"^ = Ac^ (in which case \Ja\ = l) or 

yia) ^ ^Y^^ (in which case \Ja\ = 0) . (2.5.31) 
The chirality Ca of the potentials is determined by the following rules: 

(i) The chirality ci of the first potential coincides with the chirality of the factor 

Xc- 

(a) The chirality of the potentials is reversed at every mass matrix, i.e. 



Ca and Ca+i 



coincide if Vc^^ = A^^ 
are opposite if V^^"^ = mYc^ 



The tensor indices of the multi-indices are all contracted with each other, according to 
the following rules: 

(a) No two tensor indices of the same multi-index are contracted with each other, 
(h) The tensor indices of the factor 7'^ are all contracted with different multi- 
indices. 

(c) The tensor indices of the factor {y — x)^ are all contracted with the tensor 

indices of the factors Vj^ or 7"^, but not with the partial derivatives 8^°^. 

To every order h on the light cone, the number of terms of the form (2.5.29) is finite. 
Furthermore, 

k 

2h = k-l-\K\-{- + 2pa) . (2.5.32) 

a=l 

The rules (i) and (ii) correspond precisely to our observation that the chirality 
changes at each dynamical mass matrix (2.5.27). The restrictions (a) and (b) on the 
possible contractions of tensor indices prevent an abuse of our multi-index notation. 
More precisely, (a) avoids factors {y — x)"^ in {y — x)^ , an unnecessary large number of 
7-matrices in j'^ and "hidden" Laplacians inside the partial derivatives d^^- The rule 
(b), on the other hand, prevents factors (y — x)^ and hidden Laplacians in combinations 
of the form {y — x)i (y — x)j 7* 7-' and dijVj^^ 7* 7-', respectively. The rule (c) means 
that no tangential derivatives appear. The rules (a)-(c) imply that the tensor indices 
of the multi-index K arc all contracted with the chiral potentials, except for one index 
which may be contracted with the factor 7"^. Since at most k chiral potentials appear, 
we get the inequality \K\ < A; + 1. Using this inequality in (2.5.32) we get the bound 

k 

h > -1 + JX^(|i^„| + 2p„). (2.5.33) 

^ a=l 

This shows that h never becomes smaller than —1 and that derivatives of the potentials 
increase the order on the light cone. In the case h = —1, it follows from (2.5.32) 
that \K\ > 1, meaning that at least one factor (y — x) appears in (2.5.29). We conclude 
that the factor S^*) in (2.5.29) is always well-defined by either (2.5.14) or (2.5.21). 

So far the Green's function was described perturbatively by a sum of Feynman di- 
agrams (2.5.13). In order to get from this perturbative description to non-perturbative 
formulas of the light-cone expansion, we shall now carry out the sum over all Feynman 
diagrams to any fixed order on the light cone. This procedure is called resummation of 
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the light-cone expansion. In order to give a first idea of how the resummation works, 
we consider the leading singularity on the light cone by neglecting all terms of the 
order 0{{y — x)~'^). According to (2.5.7), wc need to take into account only the con- 
tributions (2.5.29) with h = —1. The inequality (2.5.33) implies that no derivatives 
of the potentials appear. Moreover, we obtain from (2.5.32) that \K\ = k + 1. Again 
using the contraction rules (a) -(c), we conclude that one tensor index of the multi- 
index K is contracted with a Dirac matrix, whereas the remaining k indices of K 
are all contracted with chiral potentials. Therefore, all k potentials are chiral, and 
no dynamical mass matrices appear. A detailed calculation yields for the k*^ order 
Feynman diagram a term of precisely this structure, 

Xc{{-sBfs){x,y) = xci-i)" dzi {y - x)j, Ai^{z,) 

J X 

X / dz2 {y - zi)j2 {z2) ■■■ dzkiy- Zk)j^ A^^ (z^) s{x, y) 

+ 0((y-x)-2), 

where we used for the line integrals the short notation 

ry ry 



f 

J X 



ry n 
f{z) dz := / [0, I 0] f{z) dz = / f{ay + (1 - a)x) da . (2.5.34) 

Jx Jo 



I X J X 

The obtained nested line integrals can be identified with the summands of the familiar 
Dyson series. This allows us to carry out the sum over all Feynman diagrams, 

Xcs{x,y) = XcPe^P (^-i J\y - x)j Aiiz) dz^ s{x,y) + 0{{y - x)-^) , (2.5.35) 

where we again used the notation (2.5.34) and the following definition of Pexp. 

Dep. 2.5.4. For a smooth one-parameter family of matrices F{a), a G M, the 
ordered exponential Pexp(J' F{a) da) is given by the Dyson series 

fb \ fb fb i-b 

F{a) da\ = 1 + dto F{to) dto + / dto F{to) / dti F{ti) 

J J a J a J to 

+ j dto Fito) [ dti F{ti) j dt2 F{t2) + ■■■ . (2.5.36) 

The appearance of the ordered exponential in (2.5.35) can be understood from 
the local gauge invariance. We explain this relation for simplicity in the example of 
dynamical mass matrices and chiral potentials of the form 

Yl = Yr = 0, Al{x) = Ar{x) = iU{x) {dU-\x)) , 

where U = U^(^p^ is a unitary matrix on the sectors and generations. In this case, 
the Dirac operator is related to the free Dirac operator simply by a local unitary 
transformation, 

i^ + B = Ui^U-^ . 

Interpreting this local transformation as in §1.4 as a gauge transformation (1.4.3), we 
can say that the external potential can be transformed away globally by choosing a 
suitable gauge. Using the well-known behavior of the Green's function under gauge 
transformations, we obtain the simple formula 

~s{x,y) = U{x)s{x,y)U-\y). (2.5.37) 
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Let US verify that this is consistent with (2.5.35). Setting V{a) = U{ay + (1 — a)x) 
and using the relation V{V~'^y = {VV'^y - = -V'V~^, we can write the 

integrand of the ordered exponential as 

-i{y - x)j Ai{z) = -V'{a)V-^{a) . (2.5.38) 

Differentiating (2.5.36) with respect to a as well as evaluating it for a = b, one sees 
that the ordered exponential can be characterized as the solution of the initial value 
problem 

Ape.p (/^' f ) = _f („) Pexp (/J' f) , Pexp (/ f) = 1 . 

In the case F = —V'V~^, it is easily verified that the solution to this initial value 
problem is 

Pexp^-^ V\a)V-^{a)da^ = V{a)V-^{b). 

Using (2.5.38), we conclude that 

Pexp l^-i J\y - x)j Ai{z) dz^ = U{x) U-\y) . 

Thus the ordered exponential in (2.5.35) gives precisely the factor U (x) U{y)~^ needed 
for the correct behavior under gauge transformations (2.5.37). 

To higher order on the light cone, the situation clearly is more complicated. Nev- 
ertheless, it is very helpful to imagine that after rearranging the summands of the 
light-cone expansion in a suitable way, certain subseries can be summed up explicitly 
giving rise to ordered exponentials of the chiral potentials. As in (2.5.27), the chirality 
of the potentials should change each time a dynamical mass matrix appears. This 
conception is made precise by the following definition and theorem, proving that the 
light-cone expansion of the Green's function can be obtained to any given order on 
the light cone by taking a finite number of terms of the form (2.5.28) and inserting 
suitable ordered exponentials. 

Def. 2.5.5. A contribution (2.5.28) to the light-cone expansion of Theorem 2.5.3 
is called phase- free if all the tangential potentials Vj^^ are differentiated, i.e. 

\Ka\ + 2pa > whenever Ja is contracted with {y — x)^ . 

From every phase-free contribution the corresponding phase-inserted contribution is 
obtained as follows: We insert ordered exponentials according to the replacement rule 

(^^) ^ (^^) Pexp (^-i j'J^' 4', {zp+i -zp)^, (3 = 0,..., a, 

where we set zq = x and Za+i = y. The chiralities cp are determined by the rela- 
tions Co = c and 

contains an | ^^^^ number of factors y . 

Theorem 2.5.6. To every order on the light cone, the number of phase- free con- 
tributions is finite. The light-cone expansion of the Green's function s{x, y) is given 
by the sum of the corresponding phase-inserted contributions. 
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For the proof we refer to [F6]. In short, the first statement follows directly from the 
contraction rules (a)-(c) and (2.5.32), whereas for the second part one uses for fixed x 
and y the behavior of the Green's function under non-unitary local transformations of 
the spinors. 

Our constructions have led to a convenient procedure for performing the light-cone 
expansion of the Green's function. One only needs to compute to any order on the 
light cone the finite number of phase-free contributions. Then one inserts ordered ex- 
ponentials according to Def. 2.5.5. For the computation of the phase-free contributions 
we use a computer program. Appendix B lists those phase-free contributions which 
will be of relevance in Chapters 6-8. 

In the remainder of this section we describe how the above methods can be adapted 
to the fermionic projector. We begin for simplicity with the fermionic projector cor- 
responding to one Dirac sea in the vacuum (2.2.1). Similar to (2.5.4) we pull out the 
Dirac matrices, 

P^^^(x,y) = {i0.^ + m)T^2{x,y), (2.5.39) 
where T^2 is the Fourier transform of the lower mass shell, 

T^.{x,y) = / ^ik' - m') e(-fcO) e^^^^"^) . (2.5.40) 

Gomputing this Fourier integral and expanding the resulting Bessel functions gives 

T^,{x,y) = (£E + £(«")) 

Here ^ = y — x, Ce = 2C — 2 log 2 with Euler's constant C, and $ is the function 

n ^ 

$(0) = , $(n) = V T for n > 1 . 

k=l 

Similar to (2.5.6), this expansion involves distributions which are singular on the light 
cone. But in addition to singularities ~ S{{y — x)^ and ~ @{{y — x)^, now also 
singularities of the form form PP/(y — x)^ and log|(y — x)'^\ appear. In particular, 
T^2 is not causal in the sense that suppT^2(a;, .) <^ Lx- Another similarity to (2.5.6) 
is that power series in m^(y — x)^ appear. This suggests that in analogy to (2.5.7), 
the higher orders in m? should be of higher order on the light cone. However, due 
to the term log the distribution T„^2 is not a power series in m?. This means 

that the higher mass derivatives of T^2 do not exist, and the analog of (2.5.7) breaks 
down. This so-called logarithmic mass problem reflects a basic infrared problem in 
the light-cone expansion of the fermionic projector. In the vacuum, it can be resolved 
with the following simple construction. We subtract the problematic log |m^|-terms by 
setting 

Cl(x,y) = T^.{x,y) - loglm^l £ -[{^ • (2-5-42) 
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T^'^ = (-) Tr\=o a>0)- (2-5.43) 



This distribution is a power series in m^, and in analogy to (2.5.14) we can set 

Furthermore, we introduce T^~^^ similar to (2.5.21) as the distributional derivative 
of r(°). Similar to (2.5.7), 

r(") {x, y) is of the order 0{{y - xf ''-^) , 

and thus the mass expansion of T^f gives us its light-cone expansion. The point is 
that the difference of r^2 and T^f, 

T . -r- - ^ logim^i YJzl}^(-f^ 

is an absolutely convergent power series in and is thus a smooth function in position 
space. This smooth contribution is of no relevance as long as the singularities on the 
light cone are concerned. This leads us to write the fermionic projector in the form 



P--(x,y) = ^_^(i^^ + m)r(")(ar,2/) + P'^^ix^y) , (2.5.44) 

with 



n! 

n=0 



pic (^^^ ^ ^) ^T^^ _ J.-I) . 

The series in (2.5.44) is a light-cone expansion which completely describes the singu- 
lar behavior of the fermionic projector on the light cone. The so-called low-energy 
contribution P'^^ on the other hand, is a smooth function. 

This method of performing a light-cone expansion modulo smooth functions on 
the light cone also works for the general fermionic projector with interaction. But the 
situation is more complicated and at the same time more interesting, in particular be- 
cause the space-time dependence of the involved external potentials reveals the causal 
structure of the fermionic projector. The first construction step is to use the identity 
on the left side of (2.3.12) to carry over the light-cone expansion from the Green's 
function to the distribution k. Next, comparing (2.2.6) with the formula 

p^(fc) = {If + m) 5{k'^ - rr?) 
1 



iU + m) lim 
27ri £\o 



^2 _ j^2 _ ^g. — rn? + ie 



one sees that the distributions p and k differ from each only by the fe-regularization 
in momentum space. The key step of the construction is the so-called residual ar- 
gument, which relates the light-cone expansion of k to an expansion in momentum 
space. Using that the latter expansion remains unchanged if the poles of the distri- 
butions are suitably shifted in momentum space, one obtains the light-cone expansion 
for an operator p^^^, which can be regarded as a perturbation of p, but with a dif- 
ferent combinatorics than in the expansion of Theorem 2.3.1. More precisely, p can 
be obtained from p^^^ by replacing pairs of factors k in the perturbation expansion 
by corresponding factors p. The argument (2.4.2) shows that the difference p — p^^^ 
vanishes in the static case, and more generally one can say that p — p^^^ will be of 
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significance only if the frequency of the external potentials is of the order of the mass 
gap. Therefore, the operator 

P*^^ := ix(p-r^) 

is called the high-energy contribution to the fermionic projector. Moreover, resolving 
the logarithmic mass problem we obtain again a low- energy contribution P^^. We thus 
obain a representation of the fermionic projector of the form 

oo 

P^'^^{x,y) = ^ (phase-inserted line integrals) X r^"''(x, y) 

n=-l 

+P'%x,y)+P''%x,y). (2.5.45) 

Here the series is a light-cone expansion which describes the singular behavior of the 

fermionic projector on the light cone non-perturbativcly. It is obtained from the light- 
cone expansion of the Green's functions by the simple replacement rule 

In particular, the phase-inserted line integrals are exactly the same as those for the 
Green's functions (see Def. 2.5.5). The contributions P'^ and P^^, on the other hand, 
are given pcrturbatively by a series of terms which are all smooth on the light cone (we 
expect that the perturbation series for P'*^ and P'*^ should converge, but this has not yet 
been proven). The "causality" of the causal perturbation expansion can be understood 
from the fact that the phase-inserted line integrals in (2.5.45) are all bounded integrals 
along the line segment joining the points x and y (whereas the light-cone expansion 
of general operator products involves unbounded line integrals [Fl]). In particular, 
when y lies in the causal future or past of x, the light-cone expansion in (2.5.45) 
depends on the external potential only inside the "diamond" (J^ fl Jy) U (J^ fl Jy). 
Nevertheless, the light-cone expansion is not causal in the strict sense because there 
are contributions for y ^ Jx- Furthermore, the low- and high-energy contributions 
cannot be described with line integrals and violate locality as well as causality. This 
non-locality can be understood from the fact that the fermionic projector is a global 
object in space-time (see the discussion in §2.4). Mathematically, it is a consequence 
of the non-local operation of taking the absolute value of an operator (2.2.17) in the 
definition of the fermionic projector. We conclude that the singular behavior of the 
fermionic projector on the light-cone can be described explicitly by causal line integrals, 
whereas the smooth contributions to the fermionic projector are governed by non-local 
effects. 

Inspecting the explicit formulas of Appendix B, one sees immediately that from the 
line integrals of the light-cone expansion one can reconstruct the chiral and scalar /pseudo- 
scalar potentials. In this sense, P'^^^ encodes all information on the external potential. 
Furthermore, the fermionic projector gives via its representation (2.3.19) all informa- 
tion on the fermions and anti-fermions of the system. We thus come to the important 
conclusion that the fermionic projector describes the physical system completely. 

2.6. Normalization of the Fermionic States 

In §2.2 we disregarded that the fermionic states are in general not normalizable 
in infinite volume. We avoided this problem using a ^-normalization in the mass 
parameter (see e.g. (2.2.26)). In this section we will analyze the normalization in 
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detail by considering tlie system in finite volume and taking the infinite volume limit. 
Apart from justifying the formalism introduced in §2.2, this will clarify in which sense 
the fermionic projector is idempotcnt (Theorem 2.6.1). Furthermore, we will see that 
the probability integral has a well-defined infinite volume limit (Theorem 2.6.2), and 
this will also determine the normalization constant Cnorm in (2.3.19) (see (2.6.24)). We 
postpone the complications related to the chiral fermions to Appendix C and thus 
assume here that the chiral asymmetry matrix X = 1. Wc work again in the setting 
of §2.3 and assume that the external potential B satisfies the regularity assumptions 
of Lemma 2.2.2. Furthermore, we make the physically reasonable assumption that the 
masses are non- degenerate in the generations, meaning that 

Tiaa 7^ 'nT'ap for all a and a ^ ^ . (2.6.1) 

In order to ensure that all normalization integrals are finite, we need to introduce 
an infrared regularization. For clarity, we explain the construction for a single Dirac 
sea of mass m in the vacuum. First, we replace space by the three-dimensional box 

= [-h,h] X [-h, h] X [-k, k] with < < DO (2.6.2) 

and set V = |r^| = 8^1/2^3- We impose periodic boundary conditions; this means that 
we restrict the momenta k to the lattice given by 

= — X — CM^. 

h h h 

In order to carry over the operators Pm, and Sm (see §2.2) to finite volume, we leave 
the distributions in momentum space unchanged. In the transformation to position 
space, we replace the Fourier integral over 3-momentum by a Fourier series according 
to 

/ 14 - F E . (--3) 

When taking products of the resulting operators, we must take into account that the 
spatial integral is now finite. For example, we obtain that 

{PmPm'){x,y) = / Pm{x,z)pm'{z,y) d'^Z 

JMxT^ 

= f d'zT^^Y: ^-(^) ^-''^^"^ 

JmxT3 J-oo ^TT V ^ 

i-^Y.Pm'il)e^'' 2.6{k' -l')V 5^,^^ 
■~ reL3 

J-oo 27r V ^ 
= 6{m-m')pm{x,y) , (2.6.4) 
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where Pm{k) = {^+m)5{k'^ — m?'). More generally, the calculation rules (2.2.26-2.2.32) 
for products of the operators km, Pm and Sm remain valid in finite 3- volume. 

In (2.6.4) we are still using a ^-normalization in the mass parameter. In order to 
go beyond this formalism and to get into the position where we can multiply operators 
whose mass parameters coincide, we "average" the mass over a small interval [m, m+8\. 
More precisely, we set 



Pn 

Then 



rm+S pm+S 

- Pixdji and ^'^ ^ a kfj^dfi . (2.6.5) 

" Jm " Jm 



pm+S i-m+S 

PmPm = -H dfl dn' p^p^i 

Jm Jm 

= 72 / dn' 5{ii - n') pi^ ^ X2 / Ptidn = -zpm 

" Jm Jm " Jm " 



and thus, apart from the additional factor 5 ^, pm is idempotent. Similarly, we have 
the relations 

km km ~ ^ Pm and km Pm ~ Pm km ~ ^ km ■ 

Thus, introducing the infrared regularized fermionic projector corresponding to a Dirac 
sea of mass m by 

P'"" = ^iPm- km) , (2.6.6) 
this operator is indeed a projector, 

^psea^* = P^^^ and (psea-)2 ^ psea (2.6.7) 

The infinite volume limit corresponds to taking the limits hjhjh — ^ oo and (5 \ 0. 

Let us discuss the above construction. Clearly, our regularization method relies on 
special assumptions (3-dimensional box with periodic boundary conditions, averaging 
of the mass parameter). This is partly a matter of convenience, but partly also a 
necessity, because much more general regularizations would lead to unsurmountable 
technical difficulties. Generally speaking, infrared regularizations change the system 
only on the macroscopic scale near spatial infinity and possibly for large times. Due 
to the decay assumptions on the external potentials in Lemma 2.2.2, in this region the 
system is only weakly interacting. This should make infrared regularizations insensitive 
to the details of the regularization procedure, and it is reasonable to expect (although it 
is certainly not proven) that if the infinite volume limit exists, it should be independent 
of which regularization method is used. Here we simply take this assumption for 
granted and thus restrict attention to a special regularization scheme. But at least we 
will see that the infinite volume limit is independent of how the limits li oo and 
6 \ are taken. 

It is worth noting that not every infrared regularization has a well-defined in- 
finite volume limit. To see this, we consider the example of a regularization in a 
4-dimensional box. Restricting the time integral to the finite interval t G [— T, T], we 
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obtain 

{PmPm){x,y) 



k,leL^, k=l 



/_ 



e 

X (i^ + m) (/ + m) 5((A:°)2 - (^0)2) <5(jt2 _ 



and due to the factor |/c°|~^ in the last hne, this is not a multiple oi Pm{x,y). This 
problematic factor lA;'^!"^ also appears under more general circumstances (e.g. when we 
introduce boundary conditions at t = ibT and/or take averages of the mass parameter), 
and thus it seems impossible to arrange that the fermionic projector is idempotent. 
We conclude that a 4-dimensional box does not give a suitable regularization scheme. 

The mass averaging in (2.6.5) leads to the bizarre effect that for fixed k, a whole 
continuum of states of the fermionic projector, namely all states with 



-y^|fc|2 + (m + (5)2, -Y^|fc|2 + m2 



(2.6.8) 



are occupied. If one prefers to occupy only a finite number of states for every k, one 
can achieve this by taking the mass averages for the bra- and ket-states separately. 
For example, instead of (2.6.6) we could define the fermionic projector by 

J^j_^^^ E tm{k)Ul)e-''^^''' (2.6.9) 

with ijn = ^{Pm — km)- This fermionic projector is for every k composed of a finite 
number of states. Furthermore, it is a projector in the sense of (2.6.7). In con- 
trast to (2.6.6), (2.6.9) is not homogeneous in time, but decays on the scale t ~ S~^. 

However, if we restrict attention to a fixed region of space-time for which t <^ S~^, 
then (2.6.6) and (2.6.9) differ only by terms of higher order in 5, and therefore we 
can expect that (2.6.5) and (2.6.7) should have the same infinite volume limit. The 
definition (2.6.6) has the advantage that it will be easier to introduce the interaction. 

After these preparations, we come to the general construction of the fermionic 
projector in the three-dimensional box T"^. Since we want to "smear out" the mass 
similar to (2.6.5), the mass parameter needs to be variable. To this end, we introduce 
a parameter fi > which shifts all masses by the same amount. Thus we describe the 
system in the vacuum by the Dirac operator 

i^-mY-fil. (2.6.10) 

The external field is described by an operator B in the space-time R x T^, which we 
again insert into the Dirac operator, 

i^ + B - mY - nl . 
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Now we can introduce the operators p, k and their perturbation expansions exactly as 
in §2.3. For clarity, we denote the dependence on the parameter fi by an additional 

index In particular, we denote the operator products in Theorem 2.3.1 by 
and /c+jij. Since the multiplication rules (2.2.26-2.2.32) also hold in finite 3-volume, 
all the computations of §2.3 are still true. In particular, the operators and fc+j^ 
satisfy the (5-normalization conditions^ 

P+t^P+fj,' = = ^(pt - At')P+/^ (2.6.11) 

p+^k+^i = k+fj,p+fj,' = 6{fi - fi') k+^ . (2.6.12) 

In analogy to (2.6.5) and (2.6.6) we define the auxiliary fermionic projector by 

psea ^ 1 f\p^^-k^^)dli, (2.6.13) 

and the fermionic projector is again obtained by taking the partial trace (2.3.20), 

g(a) g(b) 

a=l p=l 

The next theorem shows that the fermionic projector is idempotent in the infinite 
volume limit, independent of how the limits — ^ oo and (5 \ are taken. 

Theorem 2.6.1. (idempotence of the fermionic projector) Consider a system 

composed of massive fermions with non-degenerate masses (2.6.1). Then the fermionic 
projector defined by (2.6.13) and (2.6.14) satisfies the relations 

N 

/ d'z Y^iPniix, z) {pnci^, y) = {Pnci^, y) + Qci^, y) , 

where Q has an expansion as a sum of operators which all have a well-defined infinite 
volume limit. 

Proof. For simplicity we omit the superscript 'sea'. It follows immediately 
from (2.6.11-2.6.13) that the auxiliary fermionic projector is idempotent, 

E^tt^? = (2.6.15) 

Thus it remains to show that 

EE E 47) 4t) = y) ■ (2.6.16) 

b o,7 /3,/3' with l3^/3' 

According to the non-degeneracy assumption (2.6.1), there are constants c,S > such 
that for all sufficiently small 6, 

\{mbi3 + m) - {mb/3> + > c for aU b, P ^ /?', and < /x, /x' < . (2.6.17) 



^Onl ine version: As noticed by A. Grotz, these relations are in general violated to higher order 

in perturbation theory. In order to cure the problem, one needs to rescale the states of the fermionic 
projector, as is worked out in the paper arXiv:0901.0334 [math-ph]. 
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On the left side of (2.6.16) we substitute (2.6.13) and the operator product expansions 
of Theorem 2.3.1. Using (2.6.17), the resulting operator products are all finite and can 
be estimated using the relations 

d^, f^d^i'i-.. A+^)[^;) [A^,, ...) = 0{8^) , (2.6.18) 

where each factor A stands for p, k or s. This gives (2.6.16). ■ 



At this point we can make a remark on the name "partial trace." The notion of a 
trace suggests that two matrix indices should be set equal and summed over; thus one 
may want to define the fermionic projector instead of (2.3.20) by 

This alternative definition suffers from the following problem. The off-diagonal el- 
ements of P^p-j 1 OL ^ fi^ are important to make the auxiliary fermionic projector 
idempotent, because 

(„a) (2.6^15) A ^ (6^3) in general » 

6=1 /3=i 6=1 

But these off-diagonal elements do not enter the definition (2.6.19), and this makes 
it difficult to arrange that P^ is idempotent. In more technical terms, defining the 
fermionic projector by (2.6.19) would in the proof of the above theorem lead instead 
of (2.6.16) to the conditions 

8 3 

/ p(»«) pibP) _ p{aa) p(6/3)\ _ .2 ^a/^ 

6=1 o,/3=l 

As a consequence, we would in (2.6.18) get contributions with /? = /?', which are 
singular. The only way to avoid these singular contributions would be to consider 
perturbations which are diagonal on the generations. But in this special case, also 
the auxiliary fermionic projector is diagonal on the generations, and so the defini- 
tions (2.6.19) and (2.3.20) coincide. We conclude that (2.3.20) is the more general and 
thus preferable definition of the partial trace. 

We next consider the normalization of the individual states of the fermionic pro- 
jector. In finite 3-volume in the vacuum, a Dirac sea of mass m is composed of a 
discrete number of fermionic states. More precisely, 



TT- 77 y (-^ + ^) Sik^ - m^) e(-A;°) e'^^^^^-^') 
27r V 

keL^ 

= y -^77^ m + m) e-^*^^^-^') (2.6.20) 

Here the image of [If, + m) is two-dimensional; it is spanned by the two plane-wave 
solutions of the Dirac equation of momentum k with spin up and down, respectively. 
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Thus we can write the fermionic projector in analogy to (2.2.1) as 

fcgL3 s=±l 

where are the suitably normalized negative-energy plane-wave solutions of the 
Dirac equation, and s denotes the two spin orientations. If an external field is present, it 
is still possible to decompose the fermionic projector similar to (2.6.21) into individual 
states. But clearly, each of these states is perturbed by B; we denote these perturbed 
states by a tilde. The next theorem shows that the probability integral for these states 
is independent of the interaction and of the size of T^. 

Theorem 2.6.2. (probability integral) Under the assumptions of Theorem 2.6.1, 
every state ^ of the fermionic projector is normalized according to 

[ ^^-f'^^y{t,x)dx = ^ . (2.6.22) 

Jrpa 27r 

Proof. Since * is a solution of the Dirac equation {i^ + B — mY — fil)'^ = 0, 
it follows from current conservation (see 1.2.16) that the probability integral (2.6.22) 
is time independent. Thus it suffices to compute it in the limits t ^ ±00, in which 
according to our decay assumptions on B the system is non-interacting. Since in the 
vacuum, the fermionic projector splits into a direct sum of Dirac seas, we may restrict 
attention to a single Dirac sea (2.6.21). Using that the probability integral is the same 
for both spin orientations, 

and comparing with (2.6.20) gives 

X3^*.-j7°*.-.K*,x)dx = -i-|^^-L,v(,0(^ + ^))|^^^_^^^,,^ 

1 f AhP _ 1 
i^TV Its m 



Let us discuss what this result means for the states of the fermionic projec- 
tor (2.6.13, 2.6.14). As pointed out in the paragraph of (2.6.8), the fermionic projector 
of the vacuum is composed for each /c € of a continuum of states (2.6.8). However, 
if we choose the space-time points in the fixed time interval — T < t < T and let 
(5 \ 0, we need not distinguish between the frequencies in (2.6.8) and obtain that only 

the discrete states with G -L^, = ~\J\k\^ + m? are occupied (see the discussion 
after (2.6.8)). In the causal perturbation expansion, each of these states is perturbed, 

and thus also the interacting fermionic projector for small 5 can be regarded as being 
composed of discrete states. We write in analogy to (2.6.21), 

P{x,y) = , 

a 
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where a runs over all the quantum number of the fermions. According to (2.6.13) and 
Theorem 1.5.5, the probability integral is 



By substituting the formulas of the light-cone expansion of §2.5 into (2.6.13), one 
sees that the contributions of the light-cone expansion to the fermionic projector all 
involve at least one factor S. Thus after rescaling P by a factor S~^, the probability 
integral (2.6.20) as well as the formulas of the light-cone expansion have a well-defined 
and non-trivial continuum limit. In particular, using that the particle and anti-particle 
states are to be normalized in accordance with the states of the sea, we can specify the 
normalization constant Cnorm in (2.3.19). If the wave functions and in (2.3.19) 
are normalized according to (1.2.15), we must choose 



We finally remark that Theorem 1.5.5 can be generalized in a straightforward way 

to include a gravitational field, if (2.6.22) is replaced by (1.5.22), with TC a space-like 
hypersurface with future-directed normal v. However, we need to assume that the 
gravitational field decays at infinity in the sense that space-time is asymptotically fiat 
and for t — >■ ±00 goes over asymptotically to Minkowski space. In particular, realistic 
cosmological models like the Friedman-Robertson- Walker space-times are excluded. 
We do not expect that the large-scale structure of space-time should have an influence 
on the normalization^, but this is an open problem which remains to be investigated. 



'Online version: This picture has been confirmed by the paper arXiv:0901.0602 [math-ph]. 




(2.6.23) 




(2.6.24) 



CHAPTER 3 



The Principle of the Fermionic Projector 

In this chapter we introduce a new mathematical framework intended for the for- 
mulation of physical theories. We first generalize the notions of relativistic quantum 
mechanics and classical field theory in several construction steps. This will be done 
in a very intuitive way. The aim is to work out the essence of the underlying physical 
principles by dropping all additional and less important structures. This will lead us 
to a quite abstract mathematical framework. The "principle of the fermionic projec- 
tor" states that the fundamental physical equations should be formulated within this 
framework. We conclude this chapter with a brief physical overview and discussion. 

3.1. Connection between Local Gauge Freedom and the 
Measurability of Position and Time 

In this section we give a possible explanation as to why local gauge freedom occurs 
in nature. This physical consideration will provide a formalism which will be the 
starting point for the constructions leading to the principle of the fermionic projector. 
We begin for simplicity with the example of the U{1) gauge transformations of the 
magnetic field for a Schrodinger wave function ^ in nonrelativistic quantum mechanics. 
Since it will be sufficient to consider the situation for fixed time, we only write out the 
spatial dependence of the wave function, ^ = ^(x) with x G M^. In the nonrelativistic 
and static limit, the gauge freedom of electrodynamics (1.4.1, 1.4.2) reduces to the 
transformations 

A{x) — > A{x) + VA{x) (3.1.1) 
*(f) e^^(^)*(x), (3.1.2) 

where the so-called vector potential A consists of the three spatial components of 
the electromagnetic potential A. Similar to (1.4.4), we introduce the gauge-covariant 
derivative by 

D = V-iA. (3.1.3) 

With the transformation (3.1.2) we can arbitrarily change the phase of the wave func- 
tion ^ at any point x. This is consistent with the quantum mechanical interpretation of 
the wave function, according to which the phase of a wave function is not an observable 
quantity, only its absolute square |^(x)p has a physical meaning as the probability 
density of the particle. One can even go one step further and take the point of view 
that the inability to determine the local phase of a quantum mechanical wave function 
is the physical reason for the local gauge freedom (3.1.1, 3.1.2). Then the U{1) gauge 
transformations of the magnetic field become a consequence of the principles of quan- 
tum mechanics. This argument becomes clearer when stated in more mathematical 
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terms: We consider on the Schrodinger wave functions the usual scalar product 

<^ I $> = / ^'(x) dx 
7ir3 

and denote the corresponding Hilbert space by H. On H, the position operators X 
are given as the multiplication operators with the coordinate functions, 

As it is common in quantum mechanics, we consider H as an abstract Hilbert space 
(i.e. we forget about the fact that H was introduced as a space of functions). Then 
the wave function '^{x) corresponding to a vector * G is obtained by constructing 
a position representation of the Hilbert space. In bra/ket notation, this is done by 
choosing an "eigenvector basis" |x> of the position operators, 

X \x> = x\x>, <x\y> = 5^{x-y), (3.1.4) 

and the wave function is then introduced by 

^{x) = <f|^'> (3.1.5) 

(we remark that the formal bra/ket notation can be made mathematically precise using 
spectral measures [F3]). The important point for us is that the "eigenvectors" \x> of 
the position operators are determined only up to a phase. Namely, the transformation 

\x> — > exp {-ik{x)) \x> (3.1.6) 

leaves invariant the conditions (3.1.4) for the "eigenvector basis." If we substitute 
(3.1.6) into (3.1.5), we obtain precisely the transformation (3.1.2) of the wave function. 
The transformation properties of the gauge-covariant derivative (3.1.3) and of the 
gauge potentials in (3.1.1) follow from (3.1.2) if one assumes that the gauge-covariant 
derivatives D are operators on H (and thus do not depend on the representation of 
H as functions in position space). In physics, the operators tt = —iD are called the 
"canonical momentum operators." 

The relation just described between the position representation of quantum me- 
chanical states and the U{1) gauge transformations of the magnetic field was noticed 
long ago. However, the idea of explaining local gaiigc freedom from quantum mechan- 
ical principles was not regarded as being of general significance. In particular, it was 
never extended to the relativistic setting or to more general gauge groups. The prob- 
able reason for this is that these generalizations are not quite straightforward; they 
make it necessary to formulate relativistic quantum mechanics in a particular way as 
follows. We again consider on the four-component Dirac spinors {^"{x))a=i,...,i in 
Minkowski space the spin scalar product (1.2.8) and denote the vector space of all 
Dirac wave functions by H. Integrating the spin scalar product over space-time, we 
obtain an indefinite scalar product on H, 

<^r I = / I ^y{x) d'^x . (3.1.7) 

ilR" 

Furthermore, we introduce on H time/position operators (-X'*)j=o,...,3 by multiplication 
with the coordinate functions, 

X' ^(x) = x' ^'(x) . 

We now consider (H, <.|.>) as an abstract scalar product space. In order to con- 
struct a time/position representation of H, we must choose an "eigenvector basis" 
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of the time/position operators. Since the wave functions have four components, an 
"eigenvector basis" has in bra/ket notation the form \xa>, x G M^, a = 1, . . . , 4; it is 
characterized by the conditions 

\xa> = x'' \xa> , <xa\y/3> = Sa S^{x — y) (3.1.8) 

with Sa as in (1.2.8). The wave function corresponding to a vector e H is defined 

by 

^'"(x) = <a;a|*>. (3.1.9) 

The conditions (3.1.8) determine the basis \xa> only up to local isometrics of a scalar 
product of signature (2, 2), i.e. up to transformations of the form 

4 

\xa> — > ^{U{x)-^)'^\xl3> with U{x)eU{2,2). (3.1.10) 
(3=1 

If we identify these transformations with gauge transformations and substitute into (3.1.9), 
we obtain local gauge freedom of the form 

*(x) — ^ U{x)^{x). (3.1.11) 

Since gauge transformations correspond to changes of the "eigenvector basis" |a;a>, 
we also refer to \xa> as a gauge. 

Prom the mathematical point of view, (3.1.8-3.1.10) is a straightforward gener- 
alization of (3.1.4-3.1.6) to the four-dimensional setting and four-eomponent wave 
functions, taking into account that the spin scalar product has signature (2,2). How- 
ever, our construction departs from the usual description of physics, because the time 
operator is not commonly used in relativistic quantum mechanics and because 
the scalar product (3.1.7) is unconventional. In particular, one might object that the 
scalar product (3.1.7) may be infinite for physical states, because the time integral 
diverges. However, this is not a serious problem, which could be removed for example 
by considering the system in finite 4- volume and taking a suitable infinite volume limit. 
Furthermore, one should keep in mind that the scalar product (3.1.7) gives us the spin 
scalar product, and using the spin scalar product one can introduce the usual posi- 
tive scalar product (.|.) by integrating over a space-like hypersurface (see (1.2.17) or 
more generally (1.5.22)). Therefore, it causes no principal problems to consider <.|.> 
instead of (.|.) as the fundamental scalar product. We conclude that (3.1.7-3.1.10) is 
certainly an unconventional point of view, but it is nevertheless consistent and indeed 
mathematically equivalent to the usual description of relativistic quantum mechanics 
as outlined in §1.2. 

The above explanation of local gauge freedom fits together nicely with our de- 
scription of Dirac spinors in the gravitational field in §1.5: We let {H, <.|.>) be the 
vector space of wave functions on a manifold AI, endowed with the indefinite scalar 
product (1.5.23). For every coordinate system we introduce the corresponding mul- 
tiplication operators X*. Considering H as an abstract vector space, the arbitrariness 
of the time/position representation of H again yields the local U{2,2) gauge free- 
dom (3.1.11). We thus obtain precisely the gauge transformations (1.5.2). In this way, 
(3.1.8-3.1.10) is not only consistent with all the constructions in §1.5, but it also gives 
a simple explanation for the gauge group U{2, 2). 

The U (2, 2) gauge symmetry describes gravitation and electrodynamics, but it 
does not include the weak and strong interactions. In order to obtain additional gauge 
freedom, we must extend the gauge group. Since our gauge group is the isometry group 
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of the spin scalar product, this can be accomphshed only by increasing the number of 
components of the wave functions. In general, one can take wave functions with p + q 
components and a spin scalar product of signature (p, q) , 

p+q 

^^\^y{x) = *"(a;)* with 

a=l 

Si = ■ ■ ■ = Sp = 1 , Sp+i = ■■■ = Sp+q = -1 . (3.1.12) 

We call {p,q) the spin dimension. Repeating the above construction (3.1.7-3.1.9) for 
this spin scalar product yields local gauge freedom with gauge group U{p, q). Unfortu- 
nately, it is not possible to introduce the Dirac operator in this generality. Therefore, 
we will always assume that the spin dimension is (2iV, 2N) with N > 1. In this case, 
one can regard the component wave functions as the direct sum of N Dirac spinors, 
exactly as we did in the general definition of the fermionic projector (2.3.1). Then our 
above argument yields the gauge group U{2N, 2N). The interaction can be described 
for example as in §2.3 by inserting a multiplication operator B into the Dirac opera- 
tor (2.3.10) and taking the partial trace (2.3.20). More generally, one can modify the 
first order terms of the Dirac operator in analogy to Def. 1.5.1. Our concept is that 
the U{2n,2n) gauge symmetry should be related to corresponding gauge potentials 
in the Dirac operator, and that this should, in the correct model, give rise to the 
gravitational, strong and electroweak gauge fields. 

For clarity, we finally point out the differences of our approach to standard gauge 
theories. Usually, the gauge groups (e.g. the SU{2)w or SU{3)s in the standard model) 
act on separate indices of the wave functions (called the isospin and color indices, 
respectively). In contrast to this, our [7(2,2) gauge transformations simply act on 
the spinor index. In our generalization to higher spin dimension (3.1.12), we make 
no distinction between the spinor index and the index of the gauge fields; they are 
both combined in one index a = 1, . . . ,4iV. In our setting, the gauge group and the 
coupling of the gauge fields to the Dirac particles are completely determined by the 
spin dimension. Compared to standard gauge theories, where the gauge groups and 
their couplings can be chosen arbitrarily, this is a strong restriction for the formulation 
of physical models. 

3.2. Projection on Fermionic States 

The fermionic projector was introduced in Chapter 2 in order to resolve the external 

field problem, and we used it to describe a general many-fcrmion system (2.3.19). We 
now discuss the concept of working with a "projector" in a more general context. A 
single Dirac particle is clearly described by its wave function '^°'{x) = <xa|^>, or, 
in a gauge-independent way, by a vector £ H. Since the phase and normalization 
of ^ have no physical significance, we prefer to describe the Dirac particle by the 
one-dimensional subspace <^'> = {A^*, A G C} C if. Now consider the system of n 
Dirac particles, which occupy the one-particle states G H. Generalizing 

the subspace <^'> of the one-particle system, it seems natural to describe the many- 
particle system by the subspace <5'i, . . . , 5'„> C H spanned by We 
consider for simplicity only the generic case that this subspace is non-degenerate (i.e. 
there should be no vectors / ^' G F with <*|$> = for ah ^ £ Y). Just 
as in positive definite scalar product spaces, every non-degenerate subspace Y C H 
uniquely determines a projector Py on this subspace, characterized by the conditions 
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Py = Py = Py Im(Py) = Y, where the star denotes the adjoint with respect to 
the scalar product <.|.>. Instead of working with the subspace <*i, . . . , *n> C H, 
it is more convenient for us to consider the corresponding projector P, 

P = i'<*i,...,5r„> . 

We call P the fermionic projector. In this work we will always describe the Dirac 
particles of the system by a fermionic projector. 

The concept of the fermionic projector departs from the usual description of a 
many-particle state by a vector of the fermionic Fock space (as introduced in §1.3). 
Let us discuss this difference in detail. In many-particle quantum mechanics, the 
system of Dirac particles ^i, . . . , ^'^ is described by the anti-symmetric product wave 
function 

= ^r^A---A*n- (3.2.1) 

The wave functions of the form (3.2.1) arc called n-particlc Hartree-Fock states. They 
span the ra-particle Fock space = /\" H. In the fermionic Fock space formalism, a 
quantum state is a linear combination of Hartree-Fock states, i.e. a vector of the Fock 
space F = (^^^ §^-3 foi' details). In order to connect the fermionic projector 

with the Fock space formalism, we can associate to a projector Py on a subspace 

Y = <^'i, . . . , ^'n> C H the wave function (3.2.1). This mapping clearly depends on 
the choice of the basis of Y. More precisely, choosing another basis $j = J2^=i '^j^j^ 
we have 

$1 A • • • A = det(K) A ■ ■ ■ A^n ■ 
This shows that, due to the anti-symmetrization, the mapping is unique up to a com- 
plex factor. Therefore, with the mapping 

^<*i,...,*„> ^ <^iA---A*„>CF" 

we can associate to every projector a unique one-dimensional subspace of the Fock 
space. Since the image of this mapping is always a Hartree-Fock state, wc obtain 
a one-to-one correspondence between the projectors Py on n-dimensional subspaces 

Y C H and n-particle Hartree-Fock states. In this way, one sees that the description of 
a many-particle state with the fermionic projector is equivalent to using a Hartree-Fock 
state. With this correspondence, the formalism of the fermionic projector becomes a 
special case of the Fock space formalism, obtained by restricting to Hartrcc-Fock states. 
In particular, we conclude that the physical concepts behind fermionic Fock spaces, 
namely the Pauli Exclusion Principle and the fact that quantum mechanical particles 
are indistinguishable (see page 17), arc also respected by the fermionic projector. How- 
ever, we point out that the the fermionic projector is not mathematically equivalent 
to a state of the Fock space, because a vector of the Fock space can in general be 
represented only by a linear combination of Hartree-Fock states. 

Let us analyze what this mathematical difference means physically. If nature is 
described by a fermionic projector, the joint wave function of all fermions of the Uni- 
verse must be a Hartree-Fock state. However, this condition cannot be immediately 
verified in experiments, because measurements can never take into account all exist- 
ing fermions. In all realistic situations, one must restrict the observations to a small 
subsystem of the Universe. As is worked out in Appendix A, the effective wave func- 
tion of a subsystem need not be a Hartree-Fock state; it corresponds to an arbitrary 
vector of the Fock space of the subsystem, assuming that the number of particles of 
the whole system is sufficiently large. From this we conclude that the description of 
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the many-particle system with the fermionic projector is indeed physically equivalent 
to the Fock space formalism. For theoretical considerations, it must be taken into 

account that the fermionic projector merely corresponds to a Hartree-Fock state; for 
all practical purposes, however, one can just as well work with the whole Fock space. 

We saw after (3.2.1) that the description of a many-particle state with the fermionic 
projector implies the Pauli Exclusion Principle. This can also be understood directly 
in a non-technical way as follows. For a given state E H, we can form the projector 
P<^> describing the one-particle state, but there is no projector which would cor- 
respond to a two-particle state (notice that the naive generalization 2P<^> is not a 
projector). More generally, every vector ^ £ H either lies in the image of P, ^ £ P{H), 
or it does not. Via these two conditions, the fermionic projector encodes for every state 
e H the occupation numbers 1 and 0, respectively, but it is impossible to describe 
higher occupation numbers. In this way, the fermionic projector naturally incorporates 
the Pauli Exclusion Principle in its formulation that each quantum mechanical state 
may be occupied by at most one fermion. 

As explained at the end of §2.5, the fermionic projector contains all the information 
about the physical system in the sense that from a given fermionic projector one can 
uniquely reconstruct the fermionic states as well as the Dirac operator with interaction. 
Therefore, it is consistent to consider the fermionic projector as the basic object in 
space-time and to regard the Dirac operator merely as an auxiliary object which is 
useful in describing the interaction of the fermions via classical fields. 

3.3. Discretization of Space-Time 

The ultraviolet divergences of perturbative QFT indicate that the current descrip- 
tion of physics should break down at very small distances. It is generally believed 
that the length scale where yet unknown physical effects should become important 
is given by the Planck length. Here we will assume that space-time consists on the 
Planck scale of discrete space-time points. The simplest way to discrctizc space-time 
would be to replace the space-time continuum by a four-dimensional lattice (as it is 
e.g. done in lattice gauge theories). In the following construction, we go much further 
and discretize space-time in a way where notions like "lattice spacing" and "neighbor- 
ing lattice points" are given up. On the other hand, we will retain the principles of 
general relativity and our local gauge freedom. 

We first consider the situation in a given coordinate system in space-time^. For 
the discretization we replace the time/position operators X' by mutually commut- 
ing operators with a purely discrete spectrum. We take the joint spectrum of these 
operators, i.e. the set 

M = {x G I there is u e H with X^u = x^u for all i = 0, . . . , 3} , 

as our discrete space-time points. We assume that the joint eigenspaces Sx of the X*, 

Cx = {u\ X^'u = x^tx for alH = 0, . . . , 3} , x e M , 



^We assume for simplicity that the chart a;' describes all space-time. The generalization to a 
non-trivial space-time topology is done in a straightforward way by gluing together different charts; 
for details see [F3]. 
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are 4A''-dimensional subspaces of H, on which the scalar product <.|.> has the signa- 
ture (2iV, 2N). Then we can choose a basis \xa>, x G M, a = 1,. . . , 4N satisfying 

X* \xa> = X* \xa> , <xa \ yP> = Sa Sa/s Sxy with 

si = • • • = S2Ar = 1 , S2Ar+i = • • • = S4Ar = -1 • (3.3.1) 

These relations differ from (3.1.8) only by the replacement S'^{x — y) — S^y It is useful 
to introduce the projectors Ex on the eigenspaces Cx by 

p+q 

Ex = Sa \xa><xa\ ; (3.3.2) 

a=l 

they satisfy the relations 

X'Ex = x' Ex and (3.3.3) 
E* = Ex , Ex Ey = Sxy Ex , ^2 = ^ , (3.3.4) 

x<=M 

where the star denotes the adjoint with respect to the scalar product <.|.> (these rela- 
tions immediately follow from (3.3.1) and the fact that \xa> is a basis). The operators 
Ex are independent of the choice of the basis \xa>, they are uniquely characterized 
by (3.3.3, 3.3.4) as the spectral projectors of the operators X*. 

If we change the coordinate system to = a;*(x), the discrete space-time points 
M C are mapped to different points in M^, more precisely 

M = x{M) , Ex^x) = Ex . (3.3.5) 

With such coordinate transformations, the relative position of the discrete space-time 
points in can be arbitrarily changed. Taking general coordinate invariance seriously 
on the Planck scale, this is consistent only if we forget about the fact that M and M 
are subsets of M"^ and consider them merely as index sets for the spectral projectors. 
In other words, we give up the ordering of the discrete space-time points, which is 
inherited from the ambient vector space M^, and consider M and M only as point 
sets. After this generalization, we can identify M with M (via the equivalence relation 
x{x) ~ x). According to (3.3.5), the spectral projectors (Ep)p^M are then independent 
of the choice of coordinates. 

We regard the projectors {Ep)p^M as the basic objects describing space-time. The 
time/position operators can be deduced from them. Namely, every coordinate system 
yields an injection of the discrete space-time points 

X : M , (3.3.6) 

and the corresponding time/position operators X* can be written as 

X' = Y,x\p)Ep. (3.3.7) 

Since every injection of the discrete space-time points into can be realized by a 

suitable choice of coordinates (i.e. for every injection /, : M there is a chart x* 

such that x(M) = t(M)), we can drop the condition that x is induced by a coordinate 
system. We can thus take for x in (3.3.6, 3.3.7) any embedding of M into M^. 

Let us summarize the result of our construction. We shall describe space-time by 
an indefinite scalar product space (i7, <.|.>) and projectors {Ep)p<zM on where M is 
a (finite or countable) index set. The projectors Ep are characterized by the conditions 
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(3.3.4). Furthermore, we assume that the spin dimension is (2A^, 2A''), i.e. Ep{H) C H 
is for all p G M a subspace of signature {2N,2N). We call {H, <.|.>, {Ep)p^M) 
discrete space-time. The equivalence principle is taken into account via the freedom 
in choosing the embeddings (3.3.6, 3.3.7) of the discrete space-time points. Moreover, 
one can choose a basis \pa>, p G M, a= 1, . . . , 4Ar, of H satisfying the conditions 

Ep \qa> = 6pq \pa> , <pa | qP> = Sa Safs 5pq 

with Sq as in (3.3.1); such a basis is called a gauge. It is determined only up to 
transformations of the form 

2N 

\pa> ^{U{p)-^ fp\pl3> with U{p) e U{2N,2N) . (3.3.8) 
(3=1 

These are the local gauge transformations of discrete space-time. 

3.4. The Principle of the Fermionic Projector 

For the complete description of a physical system we must introduce additional 
objects in discrete space-time {H, <.|.>, {Ep)p^M)- As mentioned at the end of §3.2, 
one can in the space-time continuum regard the fermionic projector as the basic phys- 
ical object. Therefore, it seems promising to carry over the fermionic projector to 
discrete space-time. We introduce the fermionic projector of discrete space-time P as 
a projector acting on the vector space H of discrete space-time. 

In analogy to the situation for the continuum, wc expect that a physical system 
can be completely characterized by a fermionic projector in discrete space-time. At 
this stage, however, it is not at all clear whether this description makes any physical 
sense. In particular, it seems problematic that neither the Dirac equation nor the 
classical field equations can be formulated in or extended to discrete space-time; thus 
it becomes necessary to replace them by equations of completely diflcerent type. We 
take it as an ad-hoc postulate that this can actually be done; namely we assert 

The Principle of the Fermionic Projector: 
A physical system is completely described by the fermionic projector 
in discrete space-time. The physical equations should be formulated 
exclusively with the fermionic projector in discrete space-time, i.e. 
they must be stated in terms of the operators P and {Ep)p^M on H. 

Clearly, the validity and consequences of this postulate still need to be investigated; 

this is precisely the aim of the present work. The physical equations formulated with 

P and {Ep)pQM are called the equations of discrete space-time. 

3.5. A Variational Principle 

Before coming to the general discussion of the principle of the fermionic projector, 
we want to give an example of a variational principle in discrete space-time. This is 
done to give the reader an idea of how one can formulate equations in discrete space- 
time. This example will serve as our model variational principle, and we will often 
come back to it. A more detailed motivation of our Lagrangian is given in Chapter 5. 

Let us first discuss the general mathematical form of possible equations in discrete 
space-time. The operators P and {Ep)p^M sll have a very simple structure in that 
they are projectors acting on H. Therefore, it is not worth studying these operators 
separately; for physically promising equations, we must combine the projectors P 
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and {Ep)p^M in a mathematically interesting way. Composite expressions in these 
operators can be manipulated using the idempotence of P and the relations (3.3.4) 
between the projectors {Ep)p,=M' First of all, the identities ^p^M^p ~ ^'^d ~ ^p 
allow us to insert factors Ep into the formulas; e.g. 

E^P^ = E^pl^Ey]^ = ^{E^PEy)Ey'^ . 

\yeM J yeM 

Writing 

P{x,y) = E^PEy, 

we obtain the identity 

E,{P^) = Y,P{x,y)Ey^. 

This representation of P by a sum over the discrete space-time points resembles the 
integral representation of an operator in the continuum with an integral kernel. There- 
fore, we call P(x, y) the discrete kernel of the fermionic projector. The discrete kernel 
can be regarded as a canonical representation of the fermionic projector of discrete 
space-time, induced by the projectors (£'p)peAf- Now consider a general product of 
the operators P and {Ep)p^M- Using the relations P^ = P and E^ Ey = 6xy E^, every 
operator product can be simplified to one with alternating factors P and Ep, i.e. to an 
operator product of the form 

Ex, P Ex, PEx,--- Ex„_, P Ex„ with Xj G M. (3.5.1) 

Again using that Ep = Ep, we can rewrite this product with the discrete kernel as 

P{xi,X2) P{X2,X3) ■■■ P{Xn-l,Xn). (3.5.2) 

We conclude that the equations of discrete space-time should be formed of products 
of the discrete kernel, where the second argument of each factor must coincide with 
the first argument of the following factor. We refer to (3.5.2) as a chain. 

In analogy to the Lagrangian formulation of classical field theory, wc want to set 
up a variational principle. Our "action" should be a scalar functional depending on 
the operators P and Ep. Most scalar functionals on operators (like the trace or the 
determinant) can be applied only to cndomorphisms (i.e. to operators which map a 
vector space into itself). The chain (3.5.2) is a mapping from the subspacc Ex,^ (H) C H 
to Ex,{H). This makes it difficult to form a scalar, unless xi = Xn- Therefore, we will 
only consider closed chains 

P{x,yi)P{yi,y2)---P{yk,x) : Ex{H) ^ Ex{H) . 

In the simplest case = 0, the closed chain degenerates to a single factor P{x,x). 
This turns out to be too simple for the formulation of a physically interesting action, 
mainly because the light-cone structure of the fermionic projector (see §2.5) would 
then not enter the variational principle. Thus we are led to considering closed chains 
of two factors, i.e. to the operator product P{x, y) P{y, x). Suppose that we are given a 
real- valued functional C on the cndomorphisms of Ex{H) C H (this will be discussed 
and specified below). Then C[P{x,y) P{y,x)] is a real function depending on two 
space-time arguments, and we get a scalar by summing over x and y. Therefore, we 
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take for our action S the ansatz 

S = i^[P{x,y)P{y,x)\. (3.5.3) 

This ansatz is called a two-point action, and in analogy to classical field theory we call 
C the corresponding Lagrangian. 

We shall now introduce a particular Lagrangian C The requirement which will 
lead us quite naturally to this Lagrangian is that C should be positive. Positivity of the 
action is desirable because it is a more convincing concept to look for a local minimum 
of the action than merely for a critical point of an action which is unbounded below. 

Let us first consider how one can form a positive functional on P{x, y) P{y, x). The 
closed chain P{x,y) P{y,x) is an endomorphism of Er^{H); we abbreviate it in what 
follows by A. In a given gauge, A is represented by a AN x AN matrix. Under gauge 
transformations (3.3.8), this matrix transforms according to the adjoint representation, 

A U{x)AU{x)-^ . 

Furthermore, A is Hermitian on Ex{H), i.e. 

<A^\^> = <^\A^> for G £/^(iy) , (3.5.4) 

or simply A* = A. In positive definite scalar product spaces, the natural positive 
functional on operators is an operator norm, e.g. the Hilbert-Schmidt norm ||-B||2 = 
tT{B*B)2 . In our setting, the situation is more difficult because our scalar product 
<.|.> is indefinite on Ex{H) (of signature (2N,2N)). As a consequence, Hermitian 
matrices do not have the same nice properties as in positive definite scalar product 
spaces; in particular, the matrix A might have complex eigenvalues, and it is in general 
not even diagonalizablc. Also, the operator product A* A need not be positive, so that 
we cannot introduce a Hilbert-Schmidt norm. In order to analyze the situation more 
systematically, we decompose the characteristic polynomial of A into linear factors 

K 

det(A- A) = JJ(A- Afe)"'= . (3.5.5) 

k=l 

This decomposition is useful because every functional on A can be expressed in terms 
of the roots and multiplicities of the characteristic polynomial; thus it is sufficient 
to consider the A^'s and n^'s in what follows. Each root Afc corresponds to an nk- 
dimensional A- invariant subspace of Ex{H), as one sees immediately from a Jordan 
representation of A. The roots A^ may be complex. But since A is Hermitian (3.5.4), 
we know at least that the characteristic polynomial of A is real, 

det(A - A) = det(A - A) for A G R. 

This means that the complex conjugate of every root is again a root with the same 
multiplicity (i.e. for every A^ there is a A; with A^ = A; and = ni). The reality 
of the characteristic polynomial is verified in detail as follows. In a given gauge, we 
can form the transposed, complex conjugated matrix of A, denoted by A^ . For clarity, 
we point out that A^ is not an endomorphism of Ex{H), because it has the wrong 
behavior under gauge transformations (in particular, the trace tr(74^^) depends on 
the gauge and is thus ill-defined). Nevertheless, the matrix A^ is useful because we 
can write the adjoint of A in the form A* = SA^ S, where 5" is the spin signature matrix. 
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S = diag((sa)a=i,...,4Ar)- Since 5^ = 1, and since the determinant is multiplicative, we 
conclude that for any real A, 

det(A - A) = det(A-At) = det{X -S^A^) 

= det(A - SA^S) = det(A - A*) = det(A - A) . 

It is worth noting that every Lagrangian is symmetric in the two arguments x and y, 
as the following consideration shows. For any two quadratic matrices B and C, we 
choose e not in the spectrum of C and set = C — el. Taking the determinant of the 
relation C^(BC^ — X) = {C^ B — X)C^ , we can use that the determinant is multiplicative 
and that det 7^ to obtain the equation det(SC^ — X) = det{C^B — A). Since both 
determinants are continuous in e, this equation holds even for all e G M, proving the 
elementary identity 

det(SC-Al) = det(C5-Al). (3.5.6) 
Applying this identity to the closed chain, 

detiP{x,y)P{y,x)-Xl) = det{P{y,x) P{x,y) - XI) , 

wc find that the characteristic polynomial of the matrix A remains the same if the two 
arguments x and y are interchanged. Hence 

C[P{x, y) P{y, x)] = C[P{y, x) P{x, y)] . (3.5.7) 

An obvious way to form a positive functional is to add up the absolute values of 
the roots, taking into account their multiplicities. We thus define the spectral weight 
\A\ of A by 

K 

\A\ = ^nk\Xk\. (3.5.8) 

fc=i 

This functional depends continuously on the Afe, and also it behaves continuously when 
the roots of the characteristic polynomial degenerate and the multiplicities Uk change. 
Thus the spectral weight | . | is a continuous functional. Furthermore, the spectral 
weight is zero if and only if the characteristic polynomial is trivial, det(A — A) = X*^ . 
This is equivalent to A being nilpotent (i.e. A'' = for some k). Thus, in contrast 
to an operator norm, the vanishing of the spectral weight does not imply that the 
operator is zero. On the other hand, it docs not seem possible to define an operator 
norm in indefinite scalar product spaces, and therefore we must work instead with the 
spectral weight. 

Using the spectral weight, one can write down many positive Lagrangians. The 
simplest choice would be C[A\ = \A\. Minimizing the corresponding action (3.5.3) 
yields a variational principle which attempts to make the absolute values of the roots 
|Afc| as small as possible. This turns out to be a too strong minimizing principle. It is 
preferable to formulate a variational principle which aspires to equalize the absolute 
values of all roots. This can be accomplished by combining the expressions [A^l and 
Namely, using that the sum of the multiplicities equals the dimension of the 
vector space, J2k=i ^fe = ^-^j Schwarz inequality yields that 

k=l \k=l / 
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and equality holds only if the absolute values of all roots are equal. Thus it is reasonable 
to minimize |^^|, keeping \A\'^ fixed. This is our motivation for considering the two- 
point action: 

minimize S = \{P{x,y) P{y,x))^\ (3.5.9) 

x,yeM 

under the constraint 

r := ^ \P{x,y) P{y,x)f = const. (3.5.10) 

x,yGM 

This is our model variational principle. 

We next consider a stationary point of the action and derive the corresponding 
"Euler-Lagrange equations." For simplicity, we only consider the case that P{x, y)P{y, x) 
can be diagonalized. This is the generic situation; the case of a non-diagonalizablc ma- 
trix can be obtained from it by an approximation procedure. Having this in mind, we 
may assume that the endomorphism A = P{x, y) P{y, x) has a spectral decomposition 
of the form 

K 

A = J2^kFk, (3.5.11) 

k=l 

where are the roots in (3.5.5), and the are operators mapping onto the corre- 
sponding eigenspaces {A, K, the \k, and the clearly depend on x and y, but we 
will, for ease in notation, usually not write out this dependence). Since the underlying 
scalar product space is indefinite, the spectral decomposition (3.5.11) requires a brief 
explanation. Suppose that we choose a basis where A is diagonal. In this basis, the 
operators F^. are simply the diagonal matrices with diagonal entries 1 if the corre- 
sponding diagonal elements of A are A^, and otherwise. Clearly, these operators 
map onto the eigenspaces and are orthonormal and complete, i.e. 

K 

AFk = Ajfc Ffe , FkFi = SkiFk and ^-Z^fc = ■ 

k=l 

However, the Fk arc in general not Hermitian (with respect to the spin scalar prod- 
uct). More precisely, taking the adjoint swaps the operators corresponding to complex 
conjugated eigenvalues, 

F^ = Fi when A^ = A, . (3.5.12) 

These relations can be understood immediately because they ensure that the spectral 
decomposition (3.5.11) is Hermitian, 

(K \ * K (3 5 12) 

^kPkj = ^ Afc F^ = ^ Afc Fk . 
k=l ) k=\ k=l 

Since the eigenvalues are in general complex, we can introduce a new matrix by taking 
the complex conjugate of the eigenvalues but leaving the spectral projectors unchanged, 

K 

A = Y,'^kFk (3.5.13) 

k=l 

We refer to A as the spectral adjoint of A. 

We now consider continuous variations P{t) and {Ep{T))p,zM, — e < r < e, of 
our operators. The structure of the operators must be respected by the variations; 
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this means that P(r) should be a projector and that the relations (3.3.4) between the 
operators {Ep)p^M should hold for all r. Continuity of the variation implies that the 
rank of P and the signature of its image do not change. This implies that the variation 
of P can be realized by a unitary transformation 

P(r) = C/(T)P^7(r)-^ (3.5.14) 

where U{t) is a unitary operator on H with U{0) = 1. Similarly, the variations of 
the projectors {Ep)p^M are also unitary. From (3.3.4) we can conclude the stronger 
statement that the variations of all operators (Ep)p^M can be realized by one unitary 
transformation, i.e. 

Epir) = V(T)EpV{T)-' 

with a unitary operator V{t) and V{0) = 1. Since our action is invariant under unitary 
transformations of the vector space H, we can, instead of unitarily transforming both 
P and {Ep)p^M, just as well keep the {Ep)pi^M fixed and vary only the fermionic 
projector by (3.5.14). To first order in r, this variation becomes 

SP = ^Pir)\r=o = i [B, P] , (3.5.15) 

where B = —iU'{0) is a Hermitian operator on H. We will only consider variations 
where B has finite support , i.e. where the kernel B{x,y) = E^ B Ey oi B satisfies the 
condition 

B{x, y) = except ioi x,y e N C M with #A/' finite. 

This condition can be regarded as the analogue of the assumption in the classical 
calculus of variations that the variation should have compact support. 

Let us compute the variation of the action (3.5.9) (the constraint (3.5.10) will be 
considered afterwards). Writing out the action with the eigenvalues and multiplic- 
ities rik, we obtain 

x,yeM k=l 

The variation can be computed in perturbation theory to first order, 

K 

5S = 2Re ^ ^A^tr(FfcM) 

x,yeM k=l 
K 

= 2Re 5ZA^tr(Ffe(5P(x,y)P(y,a;) + P(x,y)5P(y,x))), 

x,y€M k=l 

where "tr" denotes the trace in the vector space H. Exchanging the names of x and y 
in the first summand in the trace and using cyclicity of the trace, this expression can 
be written as an operator product, 

SS = 2Retr(Qi5P), (3.5.16) 

where the kernel Qi{x,y) = E^ Qi Ey of Qi has the form 



Qi{x,y) 



■ K 

.fe=l 



P{x,y) + P{x,y) 

xy 



K 



k=l 



(3.5.17) 

yx 
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and the subscripts ^^xy" and "yj;" indicate that the corresponding brackets contain the 
spectral decomposition of the operators P{x,y) P{y,x) and P{y,x) P{x,y), respec- 
tively. Note that the trace in (3.5.16) is well-defined because the trace is actually 
taken only over a finite-dimensional subspace of H. At this point the following lemma 
is useful. 

Lemma 3.5.1. Let B and C he two symmetric matrices and assume that their 
products A := BC and A := CB are both diagonalizable. Then they have the same 
eigenvalues with the same multiplicities ni,...,nK- The corresponding 

spectral projectors and F^ satisfy the relations 

FkB = BFk. (3.5.18) 

Proof. It immediately follows from (3.5.6) that the matrices A and A have the 
same eigenvalues with the same multiplicities. For any A not in the spectrum of A, we 
multiply the identity B{CB - A) = (BC - X)B from the left and right by {A - X)'^ 
and (^4 — A)~^, respectively. This gives 

{A - A)-^ B = B{A- X)~^ . 

Integrating A over a contour around any of the eigenvalues A^ and using the Cauchy 
integral formulas 

Ffe = --^ / (A-Xy^dX, Fk = (f {A-X)-^dX, 



27ri /aB,(Afc) 27ri j9B,{Xk) 

we obtain (3.5.18). 

This lemma allows us to simplify (3.5.17) to 



Qiix,y) = 2 



K 



.k=l 



P{x,y). (3.5.19) 

xy 



A short straightforward computation using (3.5.12) and Lemma 3.5.18 shows that the 
operator Qi is Hermitian. Thus the trace in (3.5.16) is real, and we conclude that 

SS = 2tr(Qi(5P) . 

The variation of our constraint (3.5.10) can be computed similarly, and one gets 

ST = 2tr{Q2SP) with 

^=1 k=l J xy 

Now consider a local minimum of the action. Handling the constraint with a 
Lagrange multiplier /x, we obtain the condition 

= SS-fi6T = 21t{{Qi-hQ2)5P) ^^-=^^ 2iiv{{Qi - nQ2)[B,P]) . 

Assume that the products (Qi — /x(32) P and P {Qi — IjlQ2) are well-defined operators. 
Since B has finite support, we can then cyclically commute the operators in the trace 
and obtain 

= 4itT{B [P, Qi-txQ2]). 
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Since B is arbitrary, we conclude that [P, Q\ — IJ.Q2] = 0, where our notation with the 
commutator imphcitly contains the condition that the involved operator products must 
be well-defined. Thus our Euler-Lagrange equations are the commutator equations 

2C^yP{x,y), (3.5.20) 
Fk . (3.5.21) 

xy 

In the formula (3.5.21) for Cxy, we consider the spectral decomposition (3.5.5, 3.5.11) 
of the closed chain P{x, y) P{y, x). The equations (3.5.20, 3.5.21) are the equations of 
discrete space-time corresponding to the variational principle (3.5.9, 3.5.10). 

3.6. Discussion 

In the previous sections the principle of the fermionic projector was introduced in 
a rather abstract mathematical way. Our constructions departed radically from the 
conventional formulation of physics, so much so that the precise relation between 
the principle of the fermionic projector and the notions of classical and quantum 
physics is not obvious. In order to clarify the situation, we now describe the general 
physical concept behind the principle of the fermionic projector and explain in words 
the connection to classical field theory, relativistic quantum mechanics and quantum 
field theory. Since we must anticipate results which will be worked out later, the 
description in this section is clearly not rigorous and is intended only to give a brief 
qualitative overview. 

The constructions in §3.1 and §3.2 are merely a reformulation of classical field 
theory and relativistic quantum mechanics. Although they are an important prepara- 
tion for the following construction steps, they do not by themselves have new physical 
implications. Therefore, we need not consider them here and begin by discussing the 
concept of discrete space-time of §3.3. With our definition of discrete space-time, the 
usual space-time continuum is given up and resolved into discrete space-time points. 
A-priori, the discrete space-time points are merely a point set without any relations 
(like for example the nearest-neighbor relation on a lattice) between them. Thus one 
may think of discrete space-time as a "disordered accumulation of isolated points." 
There exists no time parameter, nor does it make sense to speak of the "spatial dis- 
tance" between the space-time points. Clearly, this concept of a pure point set is too 
general for a reasonable description of space-time. Namely, we introduced discrete 
space-time with the intention of discretizing the space-time continuum on the Planck 
scale. Thus, for systems which are large compared to the Planck length, the discrete 
nature of space-time should not be apparent. This means that discrete space-time 
should, in a certain continuum limit, go over to a Lorentzian manifold. However, since 
M is merely a point set, discrete space-time {H, <.|.>, {Ep)p^M) is symmetric under 
permutations of the space-time points. Taking a naive continuum limit would imply 
that the points of space-time could be arbitrarily exchanged, in clear contradiction to 
the topological and causal structure of a Lorentzian manifold. 

In order to avoid this seeming inconsistency, one must keep in mind that we in- 
troduced an additional object space-time: the fermionic projector P. Via its discrete 
kernel P{x,y), the fermionic projector yields relations between the discrete space-time 
points. Our idea is that the discrete kernel should provide all structures needed for a 



[P,Q] = 



a 



xy 
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fc=l 



with Q\x,y) = 
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reasonable continuum limit. In more detail, our concept is as follows. In the space- 
time continuum (see Chapter 2), the fermionic projector is built up of all quantum 
mechanical states of the fermionic particles of the system. Closely following Dirac's 
original concept, we describe the vacuum by the "sea" of all negative-energy states; 
systems with particles and anti-particles are obtained by occupying positive-energy 
states and removing states from the Dirac sea, respectively. The fermionic projector 
of the continuum completely characterizes the physical system. In particular, its in- 
tegral kernel P(x, y) is singular if and only if y lies on the light cone centered at x. 
In this way, the fermionic projector of the continuum encodes the causal, and thus 
also topological, structure of the underlying space-time. We have in mind that the 
fermionic projector of discrete space-time should, similar to a regularization on the 
Planck scale, approximate the fermionic projector of the continuum. This means that 
on a macroscopic scale (i.e. for systems comprising a very large number of space-time 
points), the fermionic projector of discrete space-time can, to good approximation, 
be identified with a fermionic projector of the continuum. Using the just-mentioned 
properties of the continuum kernel, we conclude that the discrete kernel induces on 
discrete space-time a structure which is well-approximated by a Lorentzian manifold. 
However, on the Planck scale (i.e. for systems involving only few space-time points), 
the discrete nature of space-time becomes manifest, and the notions of space, time and 
causality cease to exit. 

The critical step for making this concept precise is the formulation of the phys- 
ical equations intrinsically in discrete space-time. Let us describe in principle how 
this is supposed to work. In the continuum description of Chapter 2, the fermionic 
projector satisfies the Dirac equation (2.3.10); furthermore the potentials entering the 
Dirac equation obey classical field equations. As a consequence of these equations, 
the fermionic projector of the continuum is an object with very specific properties. 
Our idea is that, using the special form of the fermionic projector, it should be possi- 
ble to restate the Dirac equation and classical field equations directly in terms of the 
fermionic projector. Thus we wish to formulate equations into which the fermionic 
projector enters as the basic object, and which are equivalent to, or a generalization 
of, both the Dirac equation and the classical field equations. It turns out that it is 
impossible to state equations of this type in the space-time continuum, because com- 
posite expressions in the fermionic projector are mathematically ill-defined. But one 
can formulate mathematically meaningful equations in discrete space-time, removing 
at the same time the ultraviolet problems of the continuum theory. The variational 
principle (3.5.9, 3.5.10) leading to the Euler-Lagrange equations (3.5.20, 3.5.21) is an 
example for such equations. Note that this variational principle and the corresponding 
Euler-Lagrange equations in discrete space-time are clearly not causal, but, for consis- 
tency with relativistic quantum mechanics and classical field theory, we demand that 
they should, in the continuum limit, reduce to local and causal equations (namely, to 
the Dirac and classical field equations). Since the fermionic projector is not an object 
which is commonly considered in physics, it is difficult to give an immediate physical 
interpretation for the equations of discrete space-time; only a detailed mathematical 
analysis can provide an understanding of the variational principle. If one wishes, one 
can regard the equations of discrete space-time as describing a direct particle-particle 
interaction between all the states of the fermionic projector. The collective interaction 
of the fermions of the Dirac sea with the additional particles and holes should, in the 
continuum limit, give rise to an effective interaction of fermions and anti- fermions via 
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classical fields. Ultimately, the collective particle-particle interaction should even give 
a microscopic justification for the appearance of a continuous space-time structure. 

Let us now describe the relation to quantum field theory. Since the coupled Dirac 
and classical field equations, combined with the pair creation/annihilation of Dirac's 
hole theory, yield precisely the Feynman diagrams of QFT (see e.g. [BDl]), it is clear 
that all results of perturbative quantum field theory, in particular the high precision 
tests of QFT, arc also respected by our ansatz (provided that the equations of discrete 
space-time have the correct continuum limit). Thus the only question is if the partic- 
ular effects of quantized fields, like the Planck radiation and the photo electric effect, 
can be explained in our framework. The basic physical assumption behind Planck's 
radiation law is that the energy levels of an electromagnetic radiation mode do not 
take continuous values, but are quantized in steps of E = hu). While the quantitative 
value hio of the energy steps can be understood via the quantum mechanical identifi- 
cation of energy and frequency (which is already used in classical Dirac theory), the 
crucial point of Planck's assumption lies in the occurrence of discrete energy levels. 
The photo electric effect, on the other hand, can be explained by a "discreteness" of 
the electromagnetic interaction: the electromagnetic wave tends not to transmit its 
energy continuously, but prefers to excite few atoms of the photographic material. We 
have the conception (which will, however, not be worked out in this book) that these 
different manifestations of "discreteness" should follow from the equations of discrete 
space-time if one goes beyond the approximation of an interaction via classical fields. 

If this concept of explaining the effects of quantized fields from the equations of 
discrete space-time were correct, it would even have consequences for the interpretation 
of quantum mechanics. Namely, according to the statistical interpretation, quantum 
mechanical particles are point-like; the absolute value |^'(x)p of the wave function 
gives the probability density for the particle to be at the position x. Here, we could 
regard the wave function itself as the physical object; the particle character would 
come about merely as a consequence of the "discreteness" of the interaction of the 
wave function with e.g. the atoms of a photographic material. The loss of determinism 
could be explained naturally by the non-causality of the equations of discrete space- 
time. 

We conclude that the principle of the fermionic projector raises quite fundamen- 
tal questions on the structure of space-time, the nature of field quantization and the 
interpretation of quantum mechanics. Before entering the study of these general ques- 
tions, however, it is most essential to establish a quantitative connection between the 
equations of discrete space-time and the Dirac and classical field equations. Namely, 
the principle of the fermionic projector can make physical sense only if it is consistent 
with classical field theory and relativistic quantum mechanics; thus it is of importance 
to first check this consistency. Even this comparatively simple limiting case is of high- 
est physical interest. Indeed, the principle of the fermionic projector provides a very 
restrictive framework for the formulation of physical models; for example there is no 
freedom in choosing the gauge groups, the coupling of the gauge fields to the fermions, 
or the masses of the gauge bosons. This means that, if a connection could be estab- 
lished to relativistic quantum mechanics and classical field theory, the principle of the 
fermionic projector would give an explanation for the interactions observed in nature 
and would yield theoretical predictions for particle masses and coupling constants. We 
begin with this study in the next chapters. 



CHAPTER 4 



The Continuum Limit 

According to the principle of the fermionic projector, we want to formulate physics 
with the fermionic projector P in discrete space-time {H, <.|.>, {Ep)p^M)- In this 
chapter we will establish a mathematically sound connection between this description 
and the usual formulation of physics in a space-time continuum. More precisely, we 
will develop a general technique with which equations in discrete space-time, like for 
example the Euler-Lagrange equations (3.5.20, 3.5.21), can be analyzed within the 
framework of relativistic quantum mechanics and classical field theory. Our approach 
is based on the assumption that the fermionic projector of discrete space-time can be 
obtained from the fermionic projector of the continuum by a suitable regularization 
process on the Planck scale. The basic difficulty is that composite expressions in the 
fermionic projector (like in (3.5.20)) depend essentially on how the regularization is 
carried out; our task is to analyze this dependence in detail. We will show that, if 
we study the behavior close to the light cone, the dependence on the regularization 
simplifies considerably and can be described by a finite number of parameters. Tak- 
ing these parameters as free parameters, we will end up with a well-defined effective 
continuum theory. 

We point out that, since we deduce the fermionic projector of discrete space-time 
from the fermionic projector of the continuum, the causal and topological structure 
of the space-time continuum, as well as the Dirac equation and Dirac's hole theory, 
will enter our construction from the very beginning. Thus the continuum limit cannot 
give a justification or even derivation of these structures from the equations of discrete 
space-time (for such a justification one must go beyond the continuum limit; see §5.6 
for a first attempt in this direction). The reason why it is nevertheless interesting to 
analyze the continuum limit is that we do not need to specify the classical potentials 
which enter the Dirac equation; in particular, we do not assume that they satisfy 
the classical field equations. Thus we can hope that an analysis of the equations 
of discrete space-time should give constraints for the classical potentials; this means 
physically that the equations of discrete space-time should in the continuum limit 
yield a quantitative description of the interaction of the Dirac particles via classical 
fields. This quantitative analysis of the continuum limit of interacting systems will be 
explained in Chapters 6-8. 

For clarity we will mainly restrict attention to a fermionic projector consisting of 
one Dirac sea of mass m. The generalizations to systems of fermions with different 
masses and to chiral fermions (as introduced in §2.3) are given in §4.5. Having gauge 
fields in mind, which are in quantum field theory described by bosons, we often refer 
to the external potentials contained in the operator B in the Dirac equation (2.3.10) 
as bosonic potentials and the corresponding fields as bosonic fields. 
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4.1. The Method of Variable Regular izat ion 

Let us consider how one can get a relation between the continuum fermionic projec- 
tor and the description of physics in discrete space-time. As discussed in §3.6, discrete 
space-time should for macroscopic systems go over to the usual space-time contin- 
uum. For consistency with relativistic quantum mechanics, the fermionic projector of 
discrete space-time should in this limit coincide with the continuum fermionic projec- 
tor. Using furthermore that the discretization length should be of the order of the 
Planck length, wc conclude that the fermionic projector of discrete space-time should 
correspond to a certain "regularization" of the continuum fermionic projector on the 
Planck scale. Thus it seems a physically reasonable method to construct the fermionic 
projector of discrete space-time from the fermionic projector of the continuum by a 
suitable regularization process on the Planck scale. 

Regularizations of the continuum theory are also used in perturbative QFT in or- 
der to make the divergent Feynman diagrams finite. However, there is the following 
major difference between the regularizations used in QFT and our regularization of 
the fermionic projector. In contrast to QFT, where the regularization is merely a 
mathematical technique within the renormalization procedure, we here consider the 
regularized fermionic projector as the object describing the physical reality. The reg- 
ularized fermionic projector should be a model for the fermionic projector of discrete 
space-time, which we consider as the basic physical object. As an important conse- 
quence, it is not inconsistent for us if the effective continuum theory depends on how 
the regularization is carried out. In this case, we must regularize in such a way that 
the regularized fermionic projector is a good microscopic approximation to the "physi- 
cal" fermionic projector of discrete space-time; only such a regularization can yield the 
correct effective continuum theory. This concept of giving the regularization a physical 
significance clearly suffers from the shortcoming that we have no detailed information 
about the microscopic structure of the fermionic projector in discrete space-time, and 
thus we do not know how the correct regularization should look like. In order to deal 
with this problem, we shall consider a general class of regularizations. Wc will ana- 
lyze in detail how the effective continuum theory depends on the regularization. Many 
quantities will depend sensitively on the regularization, so much so that they are unde- 
termined and thus ill-defined in the continuum limit. However, certain quantities will 
be independent of the regularization and have a simple correspondence in the contin- 
uum theory; we call these quantities macroscopic. We will try to express the effective 
continuum theory purely in terms of macroscopic quantities. We cannot expect that 
the effective continuum theory will be completely independent of the regularization. 
But for a meaningful continuum limit, it must be possible to describe the dependence 
on the regularization by a small number of parameters, which we consider as empiric 
parameters modelling the unknown microscopic structure of discrete space-time. We 
refer to this general procedure for constructing the effective continuum theory as the 
method of variable regularization. 

In order to illustrate the method of variable regularization, we mention an analogy 
to solid state physics. On the microscopic scale, a solid is composed of atoms, which 
interact with each other quantum mechanically. On the macroscopic scale, however, 
a solid can be regarded as a continuous material, described by macroscopic quantities 
like the density, the pressure, the conductivity, etc. The macroscopic quantities satisfy 
macroscopic physical equations like the equations of continuum mechanics. Ohm's law. 
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etc. Both the macroscopic characteristics of the sohd and the macroscopic physical 
laws can, at least in principle, be derived microscopically from many-particle quantum 
mechanics. However, since the details of the microscopic system (e.g. the precise form 
of the electron wave functions) are usually not known, this derivation often does not 
completely determine the macroscopic physical equations. For example, it may happen 
that a macroscopic equation can be derived only up to a proportionality factor, which 
depends on unknown microscopic properties of the solid and is thus treated in the 
macroscopic theory as an empirical parameter. The physical picture behind the method 
of variable regularization is very similar to the physics of a solid, if one considers on 
the microscopic scale our description of physics in discrete space-time and takes as 
the macroscopic theory both relativistic quantum mechanics and classical field theory. 
Clearly, the concept of discrete space-time is more hypothetical than atomic physics 
because it cannot at the moment be verified directly in experiments. But we can 
nevertheless get indirect physical evidence for the principle of the fermionic projector 
by studying whether or not the method of variable regularization leads to interesting 
results for the continuum theory. 

In the remainder of this section we will specify for which class of rcgularizations 
we shall apply the method of variable regularization. Our choice of the regularization 
scheme is an attempt to combine two different requirements. On one hand, we must 
ensure that the class of rcgularizations is large enough to clarify the dependence of the 
effective continuum theory on the regularization in sufficient detail; on the other hand, 
we must keep the technical effort on a reasonable level. Consider the integral kernel 
of the continuum fermionic projector (2.3.19, 2.5.45). Under the reasonable assump- 
tion that the fermionic wave functions and are smooth, the projectors on the 
particle/anti-particle states in (2.3.19) are smooth in x and y. The non-causal low- 
and high-energy contributions P^^ and P^*^ as well as the phase-inserted line integrals 
in (2.5.45) also depend smoothly on x and y. The factors T^^\ however, have singu- 
larities and poles on the light cone (sec (2.5.42) and (2.5.43)). Let us consider what 
would happen if we tried to formulate a variational principle similar to that in §3.5 
with the continuum kernel (instead of the discrete kernel). The just-mentioned smooth 
terms in the kernel would not lead to any difficulties; we could just multiply them with 
each other when forming the closed chain P{x,y) P{y,x), and the resulting smooth 
functions would infiuence the eigenvalues Afc(x, y) in (3.5.5) in a continuous way. How- 
ever, the singularities of T^"^ would cause severe mathematical problems because the 
multiplication of T^"'\x,y) with T^"'\y,x) leads to singularities which are ill-defined 
even in the distributional sense. For example, the naive product P{x, y) P{y, x) would 
involve singularities of the form ^ S'{{y — x)^) 5{{y — x)^) and ~ S{{y — x)^)^. This 
simple consideration shows why composite expressions in the fermionic projector make 
mathematical sense only after regularization. Furthermore, one sees that the regular- 
ization is merely needed to remove the singularities of T^"'\ Hence, it seems reasonable 
to regularize only the factors T^") in (2.5.45), but to leave the fermionic wave func- 
tions ^ a as well as the bosonic potentials unchanged. This regularization method 
implies that the fermionic wave functions and the bosonic potentials are well-defined 
also for the regularized fermionic projector; using the notation of page 90, they are 
macroscopic quantities. Therefore, we call our method of only regularizing T^") the 
assumption of macroscopic potentials and wave functions. 

The assumption of macroscopic potentials and wave functions means physically 
that energy and momentum of all bosonic fields and of each particle/anti-particle of 
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the physical system should be small compared to the Planck energy. In other words, 
we exclude the case that the physical potentials and wave functions have oscillations or 

fluctuations on the Planck scale. Namely, such microscopic inhomogeneities could not 
be described by smooth functions in the continuum limit and are thus not taken into 
account by our regularization method. If, conversely, the potentials and wave functions 
are nearly constant on the Planck scale, the unregularizcd and the (no matter by which 
method) regularized quantities almost coincide, and it is thus a good approximation 
to work in the regularized fcrmionic projector with the unregularizcd potentials and 
wave functions. 

According to the assumption of macroscopic potentials and wave functions, it re- 
mains to regularize the factors T^"''^ in (2.5.45). Recall that we constructed the dis- 
tributions T(") from the continuum kernel of the fermionic projector of the vacuum 
(2.2.1) via (2.5.39) and the expansion in the mass parameter (2.5.43). An essential step 
for getting a meaningful regularization scheme is to extend this construction to the 
case with regularization. Namely, this extension makes it sufficient to specify the regu- 
larization of the fermionic projector of the vacuum; we can then deduce the regularized 
r*^") and obtain, by substitution into (2.5.45), the regularized fermionic projector with 
interaction (if it were, on the contrary, impossible to derive the regularized T^^^ from 
the regularized fermionic projector of the vacuum, the independent regularizations of 
all functions T^"\ n = —1,0, 1,..., would involve so many free parameters that the 
effective continuum theory would be under-determined). Having in mind the extension 
of (2.5.39) and (2.5.43) to the case with regularization (which will be carried out in 
§4.5 and Appendix D), we now proceed to describe our regularization method for the 
fcrmionic projector of the vacuum. In the vacuTim and for one Dirac sea, the kernel of 
the continuum fermionic projector P(x,y) is given by the Fourier integral (2.2.2), 

P(x, y) = J + m) Sik"^ - n?) e(-fcO) e-^^^^-y) . (4.1.1) 

This distribution is invariant under translations in space-time, i.e. it depends only on 
the difference vector y — x. It seems natural and is most convenient to preserve the 
translation symmetry in the regularization. We thus assume that the kernel of the 
regularized fermionic projector of the vacuum, which we denote for simplicity again 
by P{x,y), is translation invariant, 

P{x,y) = P{y-x) for x,yeMcM.^. (4.1.2) 

We refer to (4.1.2) as a homogeneous regularization of the vacuum. Notice that the 
assumption (4.1.2) allows for both discrete and continuum regularizations. In the first 
case, the set M is taken to be a discrete subset of (e.g. a lattice), whereas in 
the latter case, M = M^. According to our concept of discrete space-time, it seems 
preferable to work with discrete regularizations. But since continuous regularizations 
give the same results and are a bit easier to handle, it is worth considering them too. 
The assumption of a homogeneous regularization of the vacuum means physically that 
the inhomogeneities of the fermionic projector on the Planck scale should be irrelevant 
for the effective continuum theory. Since such microscopic inhomogeneities can, at 
least in special cases, be described by microscopic gravitational or gauge fields, this 
assumption is closely related to the assumption of macroscopic potentials and wave 
functions discussed above. 
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Figure 4.1. Example for P, the regularized fermionic projector of the 
vacuum in momentum space. 

Taking the Fourier transform in the variable y — x, we write (4.1.2) as the Fourier 
integral 

Pix,y) = I P{P) e-'^^""'^ (4.1.3) 

with a distribution P. If one considers a discrete regularization, P may be defined 
only in a bounded region of M"^ (for a lattice regularization with lattice spacing d, for 
example, one can restrict the momenta to the "first Brillouin zone" p G ("■|)f)^)- 
In this case, we extend P to all by setting it to zero outside this bounded region. 
Although it will be of no relevance for what follows, one should keep in mind that for 
a discrete regularization, x and y take values only in the discrete set M. Let us briefly 
discuss the properties of the distribution P. First of all, P(x, y) should be the kernel 
of a Hermitian operator; this implies that P{x,y)* = P{y,x) and thus 

P{py = P(p) for ah p (4.1.4) 

(where the star again denotes the adjoint with respect to the spin scalar product). 
For consistency with the continuum theory, the regularized kernel (4.1.3) should, for 
macroscopic systems, go over to the continuum kernel (4.1.1). Thus we know that 
P{p) should, for small energy-momentum p (i.e. when both the energy p^ and the 
momentum \p\ arc small compared to the Planck energy), coincide with the distribution 
{]f + m) 6{p^ — rn?) Q{—p^). This is illustrated in the example of Figure 4.1. In the 
region I close to the origin, P looks similar to a hyperbola on the lower mass shell. 
Furthermore, we know that P is a regularization on the Planck scale. This means that, 
in contrast to the integrand in (4.1.1), P should decay at infinity, at least so rapidly 
that the integral (4.1.3) is finite for all x and y. The length scale for this decay in 
momentum space should be of the order of the Planck energy Ep = Ip . However, 
the precise form of P for large energy or momentum is completely arbitrary, as is 
indicated in Figure 4.1 by the "high energy region" II. This arbitrariness reflects our 
freedom in choosing the regularization. 

We finally make an ansatz for P which seems general enough to include all relevant 
regularization effects and which will considerably simplify our analysis. According to 
(4.1.4), P{p) is a Hermitian 4x4 matrix and can thus be written as a real linear 
combination of the basis of the Dirac algebra 1, ip, p'-f^ and o"-'^ (with the pseudoscalar 
matrix p = ij'^j^^^^^ and the bilinear covariants a^^ = | [7-' , 7^] ) . The integrand of the 
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continuum kernel (4.1.1) contains only vector and scalar components. It is reasonable 
to assume that the regularized kernel also contains no pseudoscalar and pseudovector 
components, because the regularization would otherwise break the symmetry under 
parity transformations. The inclusion of a bilinear component in P, on the other hand, 
would cause technical complications but does not seem to give anything essentially new. 
Thus we make an ansatz where P is composed only of a vector and a scalar component, 
more precisely 

Pip) = (vjip) 7^' + Hp) 1) f{p) (4.1.5) 
with a vector field v and a scalar field 0; / is a distribution. We also need to assume 
that P is reasonably regular and well-behaved; this will be specified in the following 
sections. We refer to the ansatz (4.1.5) as the assumption of a vector-scalar structure 
for the fermionic projector of the vacuum. 

4.2. The Regularized Product P{x,y) P{y,x) in the Vacuum 

According to the method of variable regularization, we must analyze how the 
effective continuum theory depends on the choice of the regularization. We shall now 
consider this problem for the simplest composite expression in the fermionic projector, 
the closed chain P{x,y) P{y,x) in the vacuum. The discussion of this example will 
explain why wc need to analyze the fermionic projector on the light cone. Working out 
this concept mathematically will eventually lead us to the general formalism described 
in §4.5. 

Using the Fourier representation (4.1.3), we can calculate the closed chain to be 

P{x, y) P{y, X) = / 1^ / 1^ ^(^i) P{k2) e-(^-'=^)(--) 



dp 



P{p + q) P{q) 



(2vr)4 

where we introduced new integration variables p = ki—k2 and q = k2- Thus the Fourier 
transform of the closed chain is given by the convolution in the square brackets. This 
reveals the following basic problem. The convolution in the square bracket involves P 
for small and for large energy-momentum. Even when p is small, a large q leads to 
a contribution where both factors P{p + q) and P{q) are evaluated for large energy- 
momenta. If we look at the example of Figure 4.1, this means that (4.2.1) depends 
essentially on the behavior of P in the high-energy region II and can thus have an 
arbitrary value. More generally, we conclude that, since the form of P for large energy 
or momentum is unknown, the value of (4.2.1) is undetermined. 

At first sight, it might seem confusing that the pointwisc product P{x, y) P{y, x) 
of the regularized fermionic projector should be undetermined, although the unregu- 
larized kernel (4.1.1) is, for y — x away from the light cone, a smooth function, and so 
pointwise multiplication causes no difficulties. In order to explain the situation in a 
simple example, we briefiy discuss the fermionic projector P obtained by adding to P 
a plane wave, 

P{x,y) = P{x,y) + e-^'^^^-^) 1. 

If the energy or the momentum of the plane wave is of the order of the Planck energy, 
the plane wave is highly oscillatory in space-time. Such an oscillatory term is irrelevant 
on the macroscopic scale. Namely, if P acts on a macroscopic function rj, the oscillatory 
term is evaluated in the weak sense, and the resulting integral J ex.p{iky) r]{y) d'^y gives 
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almost zero because the contributions with opposite signs compensate each other. This 
"oscillation argument" can be made mathematically precise using integration by parts, 
e.g. in the case of high energy ~ Ep, 

In the corresponding closed chain 

P{x,y)P{y,x) = P{x,y)P{y,x) + P{x,y) e-'''^y-^^ + e'^^^^^"^) P(y, x) + 1, 

the second and third summands are also oscillatory. In the last summand, however, 
the oscillations have dropped out, so that this term affects the macroscopic behavior 
of the closed chain. This elementary consideration illustrates why the unknown high- 
energy contribution to the fermionic projector makes it impossible to determine the 
closed chain pointwise. We remark that for very special regular izations, for example 
the regularization by convolution with a smooth "mollificr" function having compact 
support, the pointwise product makes sense away from the light cone and coincides 
approximately with the product of the unregularized kernels. But such regularizations 
seem too restrictive. We want to allow for the possibility that the fermionic projector 
describes non-trivial (yet unknown) high-energy effects. Therefore, the high-energy 
behavior of the fermionic projector should not be constrained by a too simple regular- 
ization method. 

The fact that the product P{x, y) P{y, x) is undetermined for fixed x and y docs 
not imply that a pointwise analysis of the closed chain is mathematically or physi- 
cally meaningless. But it means that a pointwise analysis would essentially involve 
the unknown high-energy behavior of P; at present this is a problem out of reach. 
Therefore, our strategy is to find a method for evaluating the closed chain in a way 
where the high-energy behavior of P becomes so unimportant that the dependence on 
the regularization can be described in a simple way. We hope that this method will 
lead us to a certain limiting case in which the equations of discrete space-time become 
manageable. 

The simplest method to avoid the pointwise analysis is to evaluate the closed chain 
in the weak sense. The Fourier representation (4.2.1) yields that 

I P{x, y) P{y, x) r,{x) d'x = fj{p) [| ^ P(;, + g) P(g)j , (4.2.2) 

where fj is the Fourier transform of a smooth function t]. For macroscopic rj (i.e. a 
function which is nearly constant on the Planck scale), the function r]{p) is localized 
in a small neighborhood oi p = and has rapid decay. Thus, exactly as (4.2.1), the 
integral (4.2.2) depends on the form of P for large energy-momentum. Hence this type 
of weak analysis is not helpful. In order to find a better method, we consider again 
the Fourier integral (4.1.3) in the example of Figure 4.1. We want to find a regime for 
y — X where the "low energy region"' I plays an important role, whereas the region II 
is irrelevant. This can be accomplished only by exploiting the special form of P in the 
low-energy region as follows. The hyperbola of the lower mass shell in region I comes 
asymptotically close to the cone C = {p^ = 0}. If we choose a vector (y — x) 7^ on the 
light cone L = {{y — x)^ = 0}, then the hypersurface H = {p\ p{y — x) = 0} is null and 
thus tangential to the cone C. This means that for all states on the hyperbola which 
are close to the straight line C CiTC, the exponential in (4.1.3) is approximately one. 
Hence all these states are "in phase" and thus yield a large contribution to the Fourier 
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integral (4.1.3). The states in the high-energy region II, however, are not in phase; 
they will give only a small contribution to (4.1.3), at least when the vector {y — x)eL 
is large, so that the exponential in (4.1.3) is highly oscillatory on the scale p ~ Ep. 
This qualitative argument shows that by considering the fermionic projector on the 
light cone, one can filter out information on the behavior of P in the neighborhood of 
a straight line along the cone C. This should enable us to analyze the states on the 
lower mass shell without being affected too much by the unknown high-energy behavior 
of P. We point out that if P{x, y) depends only on the behavior of P close to the 
cone C, then the same is immediately true for composite expressions like the product 
P{x,y) P{y,x). Thus restricting our analysis to the light cone should simplify the 
dependence on the regularization considerably, also for composite expressions like the 
closed chain. Our program for the remainder of this chapter is to make this qualitative 
argument mathematically precise and to quantify it in increasing generality. 

4.3. The Regularized Vacuum on the Light Cone, Scalar Component 

For simplicity we begin the analysis on the light cone for the scalar component of 
(4.1.5), i.e. we consider the case 

p(.p) = m fip) (4-3.1) 

(the vector component will be treated in the next section). We can assume that the 
spatial component of the vector y — x in (4.1.3) points in the direction of the x-axis 
of our Cartesian coordinate system, i.e. y — x = {t,r, 0, 0) with r > 0. Choosing 
cylindrical coordinates lo, k, p and if in momentum space, defined hy p = {io,p) and 
p={k,p cos ip, p sintp), the Fourier integral (4.1.3) takes the form 

1 roo roo roo i'2n 

Pi^^y) = T^r^ / dk pdp dipP{u,k,p,^)e"-'-'^' . (4.3.2) 

(27r)^ 7_oo J-oo Jo Jo 

Since the exponential factor in this formula is independent of p and (p, we can write 
the fermionic projector as the two-dimensional Fourier transform 

/oo /•oo 
dco / dkh{u,k)e''^^-'^'' (4.3.3) 
-oo J —oo 

of a function h defined by 

^^'^'^^ ^ 2 (27r)^ 7o ^^^i) d^ f^^^'^'P'^^ ■ (4.3.4) 

We want to analyze P{x, y) close to the light cone (y — x)^ = away from the origin 

y = X. Without loss of generality, we can restrict attention to the upper light cone 
t = r. Thus we are interested in the region t « r > 0. The "light-cone coordinates" 

s = ^{t-r), l = l{t + r) (4.3.5) 

arc wcll-suitcd to this region, bccaiise the "small" variable ,s vanishes for t = r, whereas 
the "large" variable / is positive and non-zero. Introducing also the associated momenta 

u = —k — uj , V = k — Lo , (4.3.6) 

we can write the fermionic projector as 

/oo poo 
du di;/i(u,i;)e-^(""+^') . (4.3.7) 
-oo J —oo 
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Figure 4.2. Example for h{u,v), the reduced two-dimensional distribution. 



Let us briefly discuss the qualitative form of the function h, (4.3.4). According 
to the continuum kernel (4.1.1), the scalar component (4.3.1) should, for energy and 
momentum small compared to the Planck energy Ep, go over to the 5-distribution on 
the lower mass shell P = m S{p^ — w?) Q{—p^). In this limit, the integral (4.3.4) can 
be evaluated to be 



h = I Pdp I d^6{u'-k'-p'-m')ei-uj) 



m 



^0 

2 7,2 2 



4(2vr)3®^'^ ^^ ^^^-'^^ = 32^®(^^-"' )®(^)5 (4.3.8) 

thus integrating out p and if yields a constant function in the interior of the two-di- 
mensional "lower mass shell" u'^ — k'^ = m^, a; < 0. Prom this we conclude that for 
u,v <^ Ep, h{u, v) should have a discontinuity along the hyperbola {uv = m?, u > 0}, 
be zero below (i.e. for uv < m^) and be nearly constant above. Furthermore, we know 
that h decays at infinity on the scale of the Planck energy. Similar to our discussion 
of P after (4.1.4), the precise form of h for large energy or momentum is completely 
arbitrary. The function h{u,v) corresponding to the example of Figure 4.1 is shown 
in Figure 4.2. The two branches of the hyperbola asymptotic to the u and v axes are 
labeled by "A" and "B," respectively. 

It is instructive to consider the energy scales of our system. The scale for high 
energies is clearly given by the Planck energy Ep. The relevant low-energy scale, 
on the other hand, is /Ep (it is zero for massless fermions). This is because the 
hyperbola uv = comes as close to the v-axis as as t; m^/£^p before leaving the 
low-energy region. These two energy scales are also marked in Figure 4.2. Since we 
want to analyze the situation close to the light cone, we choose the "small" light-cone 
parameter s on the Planck scale, i.e. 

s ~ Ep^ or s < Ep^ . (4.3.9) 

The "large" light-cone parameter I, on the other hand, is non-zero and thus yields a 
third energy scale. We shall always choose this scale between the two extremal energy 
scales, more precisely 

< / < /max < ^ . (4.3.10) 

Ep 
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The parameter Zmax was introduced here in order to avoid I being chosen too large. 
Namely, we will always regard I as being small compared to the length scales of macro- 
scopic physics (a reasonable value for Imax would e.g. be the Fermi length). One should 
keep in mind that the quotient of the two fundamental energy scales is in all physical 
situations extremely large; namely Ep/m^ » 10^^. Thus the constraints (4.3.10) can 
be easily satisfied and still leave us the freedom to vary / on many orders of magnitude. 

In the remainder of this section we shall evaluate the Fourier integral (4.3.7) using 
the scales (4.3.9) and (4.3.10). In preparation, we discuss and specify the function 
h{u, v) for fixed u, also denoted by hu{v). As one sees in Figure 4.2, hu will in general 
not be continuous. More precisely, in the example of Figure 4.2, /i„ has a discontinuous 
"jump" from zero to a finite value on the hyperbola (and its extension to the high- 
energy region) and maybe has a second jump to zero for large v (e.g. on line "a"). For 
simplicity, we assume that hu is always of this general form, i.e. 

' \ smooth for a„ < V < ^„ ^ ' 

with parameters a„ < The case of less than two discontinuities can be obtained 
from (4.3.11) by setting hu{oiu) or huiPu) equal to zero, or alternatively by moving the 
position of the discontinuities a„ or /3„ to infinity. We remark that the discontinuity 
at V = Pu will become irrelevant later; it is here included only to illustrate why 
the behavior of the fermionic projector on the light cone is independent of many 
regularization details. Without regularization, hu{v) is for v > au sl constant function, 
(4.3.8). Thus the v-dependence of hu{v) for au < v < Pu '^s merely a consequence 
of the regularization, and it is therefore reasonable to assume that the ^-derivatives 
of huiv) scale in inverse powers of the regularization length Ep. More precisely, we 
demand that there is a constant ci ^ lEp with 

|/ii"^(i;)| < max|^„| for a„ < ^; < ^„ , (4.3.12) 

where the derivatives at v = au and are understood as the right- and left-sided 
limits, respectively. This regularity condition is typically satisfied for polynomial, 
exponential and trigonometric functions, but it excludes small-scale fluctuations of 
hu- Clearly, we could also consider a more general ansatz for /i„ with more than two 
discontinuities or weaker regularity assumptions. But this does not seem to be the 
point because all interesting effects, namely the influence of discontinuities for small 
and large v, as well as of smooth regions, can already be studied in the setting (4.3.11, 
4.3.12). 

Let us analyze the i;-integral of the Fourier transform (4.3.7), 

/oo 
hu{v) e-'''' dv . (4.3.13) 
-oo 

According to the first inequality in (4.3.10), the exponential factor in (4.3.13) is highly 
oscillatory on the scale v ~ Ep. Thus we can expect that the smooth component of 
hu gives only a small contribution to the integral (4.3.13), so that the discontinuities 
at au and Pu play the dominant role. In order to make this picture mathematically 
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precise, we iteratively integrate in (4.3.13) K times by parts, 
Pu{l) = / K{v) e-'^^ dv = -- dv hu{v) —e 



-ivl 



K{v) e— ^ + -. h!^{v) e— ' dl = ■■■ = 

ll (Xu ll J 

-if (^)" "i-'W e-t: . (i)"£ e-« . (4.3.14) 



dv 



If we bound all summands in (4.3.14) using the first inequality in (4.3.10) and the 
regularity condition (4.3.12), each v-derivative appears in combination with a power 
of and this gives a factor ci/{lEp) <C 1. Thus we can in the limit if — ^ oo drop 
the integral in (4.3.14) and obtain 



^«(') = -aE(s) ftl-'We-'f^. (4.3.15) 

n=0 ^ ' 



This expansion converges, and its summands decay like {ci/{lEp))^ . 
Using (4.3.13), we can write the Fourier transform (4.3.7) as 

/oo 
Pu{l) e-^"" du . (4.3.16) 
-oo 

Notice that, apart from the constraints (4.3.10), the "large" variable I can be freely 
chosen. We want to study the functional dependence of (4.3.16) on the parameter I. 
In preparation, we consider an integral of the general form 

/ /(^x) e-^^(") ' , (4.3.17) 

J a 

where wc assume that (it, 7(11)) is a curve in the high-energy region, more precisely 
'J Ep. Assume furthermore that 7 is monotone with |7'| ~ 1 and that (b — a) ~ Ep. 
By transforming the integration variable, we can then write (4.3.17) as the Fourier 
integral 

nib) ^ . , 

/ iVr e"*^ ^^7 • (4.3.18) 

/7(a) 

If the function / h'l^^ is smooth, its Fourier transform (4.3.18) has rapid decay in 
the variable I. Under the stronger assumption that / j7'|^"^ varies on the scale Ep, 
we conclude that the length scale for this rapid decay is of the order / ~ Ep^ . As a 
consequence, the rapid decay can be detected even under the constraint I < /max im- 
posed by (4.3.10), and we say that (4.3.18) has rapid decay in I. The reader who feels 
uncomfortable with this informal definition can immediately make this notion mathe- 
matically precise by an integration by parts argument similar to (4.3.14) imposing for 
/ 17'|~^ a condition of type (4.3.12). The precise mathematical meaning of rapid decay 
in I for the integral (4.3.17) is that for every integer k there should be constants c ~ 1 

and /min < ^max SUch that for ah I G (/min, /max), 

f f{u)e-''^^''^^ du < c{lEp)-^ f\f{u)\du. 

J a J a 



f 
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We return to the analysis of the integral (4.3.16). The boundary terms of (4.3.15) 
at Pu yield contributions to P{s, I) of the form 

/ 1 \ "+1 roc 

-f-j J h^^\l5u) e-'l^-^-'''' du . (4.3.19) 

Recall that the points (u, /?«) arc in the high-energy region (in the example of Fig- 
ure 4.2, these points lie on curve "a"). According to (4.3.9), the length scale for the 
oscillations of the factor exp(— itts) is u ~ Ep. Under the reasonable assumption that 

j3u is monotone and that the functions |/3'(u)|~^ and h^u\Pu) vary on the scale Ep, 
the integral (4.3.19) is of the form (4.3.18), and the above consideration yields that 
(4.3.19) has rapid decay in I. We remark that this argument could be extended to the 
case where /5„ has extremal points (basically because the extrema give contributions 
only for isolated momenta u and thus can be shown to be negligible), but we will 
not go into this here. Having established rapid decay in I for (4.3.19), it remains to 
consider the boundary terms in (4.3.19) at more precisely 

+ (rapid decay in I) . (4.3.20) 

We cannot again apply our "oscillation argument" after (4.3.17), because tends 
asymptotically to zero on branch "A" of the hyperbola (see Figure 4.2), so that the 
factor exp(— zOuZ) is non-oscillating in this region. We expand this factor in a Taylor 
series, 

°° 1 roo 

P{s,l) = E ry (i/)'-"-' / ^i"H«u) (-««)' e--^ du . (4.3.21) 

n,k=0 

In the region where lau ^ 1, this expansion might seem problematic and requires a 
brief explanation. First of all, becomes large near n = (on branch "B" of the 
hyperbola in Figure 4.2). In the case without regularization, the power expansion 
of the factor exp(— I) corresponds to an expansion in the mass parameter (recall 
that in this case, = m? /u according to (4.3.8)), and in (4.3.21) it would lead to 
a singularity of the integrand at the origin. Indeed, this difficulty is a special case 
of the logarithmic mass problem which was mentioned in §2.5 and was resolved by 
working with the "regularized" distribution Ta^^ (2.5.42). Using these results, the 
behavior of the unregularized P{s,l) for small momenta u <^ Ep is well understood. 
Our oscillation argument after (4.3.17) yields that the regularization for u <C Ep (i.e. 
the form of the extension of branch "B" of the hyperbola to the high-energy region) 
affects P{s, I) merely by rapidly decaying terms. Thus it is sufficient to consider here 
the integrand in (4.3.21) away from the origin n = 0. When combined with the results 
in §2.5, our analysis will immediately yield a complete description of the regularized 
fermionic projector near the light cone. Furthermore, the function a„ might become 
large for u ~ Ep, and this is a more subtle point. One way of justifying (4.3.21) 
would be to simply assume that Imax-O^u ^ 1 along the whole extension of branch 
"A" to the high-energy region. A more general method would be to split up the 
curve (n, a„) in the high-energy region u ~ Ep into one branch where the expansion 
(4.3.21) is justified and another branch where our oscillation argument after (4.3.17) 
applies. The intermediate region lau ~ Ij where none of the two methods can be 



4.3. THE REGULARIZED VACUUM ON THE LIGHT CONE, SCALAR COMPONENT 101 



used, is generically so small that it can be neglected. In order to keep our analysis 
reasonably simple, we here assume that is sufficiently small away from the origin, 
more precisely 

OLu < "max < iovu^-Ep. (4.3.22) 



For a fixed value oi k — n, all summands in (4.3.21) have the same /-dependence. 

Let us compare the relative size of these terms. According to our regularity assumption 
(4.3.12), the derivatives of h scale like h!>a^ ~ Ep''. Using the bound (4.3.22), we con- 
clude that, for a fixed power of I, the summands in (4.3.21) decrease like (amaxZ-Ep)". 
Thus it is a very good approximation to drop the summands for large n. At first sight, 
it might seem admissible to take into account only the first summand n = 0. But the 
situation is not quite so simple. For example, it may happen that, when restricted 
to the curve («,««), the function h{u,v) is so small that the summands for n = 
in (4.3.21) are indeed not dominant. More generally, we need to know that for some 

no > 0, the function h^°\au) is really of the order given in (4.3.12), i.e. 

-^j max|/i„| for u-^Ep (4.3.23) 

with a positive constant c which is of the order one. If this condition is satisfied, we 
may neglect all summands for n > no, and collecting the terms in powers of Z, we 
conclude that 



P{s,l) 



k=Q n=max(no— fc,0) 



+ Yl cnn+i / ht\au) e-^"^ du + (rapid decay in I) 

+ (higher orders in {amax/ Ep) . (4.3.24) 



We point out that, according to (4.3.22), 

Olmax/Ep <^ (ImaxEp) 



and this explains why we disregard the higher orders in a^sx/Ep. In our case, the 
function hu has in the low-energy region according to (4.3.8) the form hu{ctu) = 
m/(327r"^) Q{u). Hence it is natural to assume that (4.3.23) is satisfied for no = 0. 
Introducing the shorter notation 

h{u) := Kiaiu)), .= /tH(a„), a{u) := a„ , (4.3.25) 

we have thus derived the following result. 

Expansion of the scalar component: Close to the light cone (4-3.9, 4-3.10), the 
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scalar component (4-3.1) of the fermionic projector of the vacuum has the expansion 
P(s,l) = -^i^l^ / ha^e-'^^'du (4.3.26) 



k=0 

+ (rapid decay in I) + (higher orders in {cti-aax/ Ep) (4.3.28) 

with suitable regularization functions h, /it"! and a. In the low-energy region u <C Ep, 
the regularization functions are 

M«) = /iM(«) = 0, a{u) = au = —. (4.3.29) 

In this expansion, the Z-dependence is written out similar to a Laurent expansion. 
The main simphfication compared to our earher Fourier representation (4.1.3) is that 
the dependence on the regularization is now described by functions of only one variable, 
denoted by h, /il"! and a. In composite expressions in P{s,l), we will typically get 
convolutions of these functions; such one-dimensional convolutions are convenient and 
can be easily analyzed. The simplification to one-dimensional regularization functions 
became possible because many details of the regularization affect only the contribution 
with rapid decay in I, which we do not consider here. Notice that the summands in 
(4.3.26) and (4.3.27) decay like {I araa^)'' / k\ < {I / Ira^y^f / k\ and {lEp)"^, respectively. 
In the low-energy limit (4.3.29), the expansion (4.3.26) goes over to a power series 
in m^, and we thus refer to (4.3.26) as the mass expansion. In the mass expansion, 
the regularization is described by only two functions h and a. The scries (4.3.27), 
on the other hand, is a pure regularization effect and is thus called the regularization 
expansion. It involves an infinite number of regularization functions /i'"' . Accordingly, 
we will use the notions of mass and regularization expansions also for other expansions 
of type (4.3.24). 

In the expansion (4.3.24), the fermionic projector is described exclusively in terms 
of the function /i(n, v) in a neighborhood of the discontinuity along the curve {u, «„)• 
Let us go back to the definition of h, (4.3.4), and consider what this result means for 
the regularized fermionic projector in momentum space (4.3.1). In the case without 
regularization (4.3.8), we saw that integrating out the cylindrical coordinates p and if 
yields a discontinuity of h whenever the 2-plane (a;, A;) = const meets and is tangential 
to the hyperboloid lxP' — Iz^ — (P' = rr? . This picture is true in the general case in 
the sense that the discontinuity of h can be associated to a contribution to P which 
describes a hypersurface in four-dimensional momentum space. The simplest way to 
recover the discontinuity of h when integrating out the cylindrical coordinates would 
be to choose P of the form (4.3.1) with a function and the spherically symmetric 
distribution / = 5(|p| — w — a(— |p| — w)). Since spherically symmetric regularizations 
seem too restrictive, it is preferable to describe the discontinuity of h more generally 
by a contribution to P of the form 

m 5(u; - 0(p)) , (4.3.30) 

which is singular on the hypersurface u = ^{p). For small momentum \p\ <C Ep, 
the surface should clearly go over to the mass shell given hy Q = —\/\p\'^ + m"^ and 
(p = m/\2n\; also, it is reasonable to assume that (p and O are smooth and sufficiently 



4.3. THE REGULARIZED VACUUM ON THE LIGHT CONE, SCALAR COMPONENT 103 

regular. This consideration shows that for the behavior of the fermionic projector on 
the hght cone (4.3.24), the essential role is played by states lying on a hypersurface. 

We refer to these one-particle states as the surface states of the fermionic projector 
of the vacuum. This result seems physically convincing because the surface states 
naturally generalize the states on the lower mass shell known from relativistic quantum 
mechanics. By integrating out the cylindrical coordinates for the ansatz (4.3.30), 
one can express the regularization functions /li"^ in (4.3.24) in terms of (j) and the 
geometry of the hypersurface. But we point out that, in contrast to the just discussed 
discontinuity of h, the partial derivatives of h depend also on states other than surface 
states. For example, a contribution to P of the form h{u,p) 0(a; — ^{p)) with $7 as in 
(4.3.30) and a smooth function h has a discontinuity on the surface and affects all 

the regularization functions /il"^ for n > 1 (as one verifies by a short computation). 
Thinking of the decomposition of the fermionic projector into the one-particle states, 
such non-surface contributions would consist of a large number of states and would thus 
make it necessary to introduce many additional fermions into our system. It does not 
seem quite reasonable or appropriate to considerably increase the number of particles 
of the system with the only purpose of having more freedom for the derivative terms 
of h in (4.3.24). It seems easiest and physically most convincing to assume that all the 
regularization functions in (4.3.24) come about as a consequence of surface states. We 
refer to this assumption as the restriction to surface states. It is of no relevance for 
the scalar component (4.3.26, 4.3.28), but it will yield an important relation between 
the regularization functions for the vector component in the next section. To avoid 
confusion, we point out that the restriction to surface states clearly does not imply 
that P is of the form (4.3.30). It imposes a condition only on the behavior of P in a 
neighborhood of our hypersurface; namely that the only distributional or non-regular 
contribution to P in this neighborhood should be the hypersurface itself. 

For clarity, we finally review our assumptions on the regularization. Our first 
assumption was that the function h{u, v) has, for every fixed u, at most two disconti- 
nuities at a{u) and /3(u), and is sufficiently regular otherwise (4.3.12). Furthermore, 
the function f5{u) had to be monotone and again sufficiently regular. For the func- 
tion a{u), we assumed that (4.3.22) holds. Since h is obtained from P, (4.3.1), by 
integrating out the cylindrical coordinates (4.3.4), these assumptions implicitly pose 
conditions on the fermionic projector of the vacuum. Although they could clearly be 
weakened with more mathematical effort, these conditions seem sufficiently general for 
the moment. In order to understand this better, one should realize that integrating out 
the cylindrical coordinates does generically (i.e. unless there are singularities parallel 
to the plane [to, k) = const) improve the regularity. The restriction to the generic case 
is in most situations justified by the fact that the direction y — x and the coordinate 
system in (4.3.2) can be freely chosen. Using the above assumptions on h{u,v), we 
showed that the dominant contribution to the fermionic projector on the light cone 
comes from states on a hypersurface in four-dimensional momentum space. With the 
"restriction to surface states" we assumed finally that the behavior on the light cone 
(4.3.24) is completely characterized by these states. 
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4.4. The Regularized Vacuum on the Light Cone, Vector Component 

We shall now extend the previous analysis to the vector component in (4.1.5). 
More precisely, we will analyze the Fourier integral (4.1.3) for 

P{p) = vj{p)^^f{p) (4.4.1) 

close to the light cone. We again choose light-cone coordinates (s, X2,X3) with y—x = 
(s,Z,0, 0) (s and I are given by (4.3.5), while X2 and xs are Cartesian coordinates in 
the orthogonal complement of the s/-plane). The associated momenta are denoted by 
p = {u,v,p2,P3) with u and v according to (4.3.6). As in (4.3.3), we integrate out the 
coordinates perpendicular to u and v, 

hj{u,v) := J dp2 J dps (vj f)iu,v,p2,P3) , (4.4.2) 

and obtain a representation of the fermionic projector involving two-dimensional Fourier 
integrals 

P{s,l) = 7'Pj{s,l) 

with 

/oo /"OO 
du dvhjiu,v)e-'^'''+''^K (4.4.3) 
oo J —oo 

The tensor indices in (4.4.2) and (4.4.3) refer to the coordinate system {s,l,X2,X3). 
For clarity, we denote the range of the indices by j = s, Z, 2, 3; thus 

7^ = 1(7°-^), y = ^(7° + y), (4.4.4) 

where 7*^, . . . ,7^ are the usual Dirac matrices of Minkowski space. According to the 
continuum kernel (4.1.1), P has in the case without regularization the form P = 
^ 5{p'^ — m?) @{—p^) and hj can be computed similar to (4.3.8) to be 

hj{u,v) = -^{-uY - vj^) Q{uv - m^) Q{u) . (4.4.5) 
327r'^ 

This limiting case specifies the regularized hj{u, v) for small energy-momentum u,v <^ 
Ep. In order to keep the form of the functions hj in the high-energy region sufficiently 
general, we merely assume in what follows that the hj satisfy all the conditions we 
considered for the function h in the previous section (see the summary in the last 
paragraph of §4.3). Our main result is the following. 

Expansion of the vector component: Close to the light cone (4-3.9, 4-3-10), the 
vector component (4.4. 1) of the fermionic projector of the vacuum has the expansion 
P = 'jiPj with 
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Ps{s,l) 



k=0 



k\ 



-u gs a'^ e du 



oo ^ 
n=l 

+ (rapid decay in /) + (higher orders in (amax/-Ep) 



(4.4.6) 



^ ^ fe=0 



(/^ - 1) a'' + k-a 
u 



k-l 



91 e 



du 



1 /-oo 



+ (rapid decay in I) + (higher orders in {ama.x/ Ep) 



(4.4.7) 



5^ 



du 



+ (rapid decay in I) + (higher orders in (amax/Ep) 



(4.4.8) 



and suitable regularization functions gj , g - , b, 62^5 and the mass regularization func- 
tion a as in (4-3.26, 4-3-29). In the low energy region u <C Ep, the regularization 
functions have the form 

^ Q{n), 5i"l(^) = (4.4.9) 



9s{u) 
9l{u) 



327r3 



(4.4.10) 
(4.4.11) 



^eiu), gt\u) = b{u) = 
92/3{u) = g'2fi{u) = b2^{u) = 0. 

Before entering the derivation, we briefly discuss these formulas. To this end, we 
consider the situation where, hke in the case without regularization, the vector v{p) in 
(4.4.1) points into the direction p. In this case we can write the vector component as 



P{p) = Pji' > 



(4.4.12) 



where {(pf) has the form of the scalar component considered in §4.3. Since multipli- 
cation in momentum space corresponds to differentiation in position space, we obtain 
for (4.4.3) 



Pis, I) 



. d , d 2d d \ 



P. 



scalar 



{s,l), 



where Pscaiar is the scalar component (4.3.7) with h as in (4.3.4). We now substitute 
for Pgcalar the expansion on the light cone (4.3.26-4.3.28) and carry out the partial 
derivatives. For the s- and i-components, this gives exactly the expansions (4.4.6, 
4.4.7) with 



9s 



91 



h, 



9i 



9l 



0. 



(4.4.13) 
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For the components j = 2,3, the calculation of the partial derivatives is not quite 
so straightforward because the expansion of the scalar component (4.3.26-4.3.28) was 
carried out for fixed X2 and 2:3. Nevertheless, one can deduce also the expansion (4.4.8) 
from (4.3.26-4.3.28) if one considers X2 and X3 as parameters of the regularization 
functions h, /i'"! and a, and differentiates through, keeping in mind that differentiation 
yields a factor l/l (to get the scaling dimensions right). In this way, the simple example 
(4.4.12) explains the general structure of the expansions (4.4.6-4.4.8). We point out 
that the regularization function b vanishes identically in (4.4.13). This means that b is 
non-zero only when the direction of the vector field v is modified by the regularization. 
Thinking in terms of the decomposition into the one-particle states, we refer to this 
regularization effect as the shear of the surface states. 

We shall now derive the expansions (4.4.6-4.4.8). Since the Fourier integrals in 
(4.4.3) are of the form (4.3.7), they have the expansion (4.3.24), valid close to the light 
cone (4.3.9, 4.3.10). It remains to determine the parameter uq in (4.3.24). We consider 
the components j = s,l,2 and 3 separately. According to (4.4.5), the function hg in 
the low-energy region looks similar to the hyperbola depicted in Figure 4.2. The main 
difference to the low-energy behavior of the scalar component (4.3.8) is the additional 
factor u in hs which grows linearly along branch "A" of the hyperbola. Thus in the 
low-energy region away from the origin, 

(/is)«(a«) ^ Ep and max \{hs)u{v)\ - Ep . (4.4.14) 

ve(s>,Ep) 

Hence it is natural to assume that hg satisfies the bound (4.3.23) with no = 0. Because 
of the linearly growing factor u in the low-energy region, it is convenient to write the 
regularization functions in the form 

{hsUau) =: -ugs{u), {hsi^\au) =: -u gf\u) (4.4.15) 

with suitable functions gs and g^f^ (this can be done because, as explained after (4.3.21), 
close to the origin w = 0, we can work with the unregularized fermionic projector). 
This yields the expansion (4.4.6). According to (4.4.5) and (4.4.15), the regularization 
functions have the low-energy limit (4.4.9). For the /-component, the situation is much 
different. According to (4.4.5), the function hi in the low-energy limit has the form 

hi{u,v) = -^ve{uv-m'^) . (4.4.16) 

The factor v decreases like m^/u along branch "A" of the hyperbola. Thus in the 
low-energy region away from the origin, 

{hi)u{au) ~ rn^/Ep whereas max \{hi)u{v)\ ~ Ep . (4.4.17) 

Therefore, we cannot assume that hi satisfies the bound (4.3.23) with uq = 0. But 

{hi)u\(Xu) ~ 1 in the low-energy region, and thus we may choose hq = 1. We conclude 
that it is necessary to take into account two inner summands in (4.3.24), more precisely 



1 °° i—il)^ f°° r 



k=0 

+ ■••, (4.4.18) 
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where "■ • • " stands for the regularization expansion and all terms neglected in (4.3.24). 
In the low-energy region, we have according to (4.4.16, 4.3.29), 

ihi)u{au) = = (hlYuioiu) 0!u . 

Thus in this region, the two summands in the square brackets of (4.4.18) are of the 
same order of magnitude, and none of them can be neglected. In view of the low-energy 
limit, we introduce the regularization functions as 

(hiYuiau) =■■ -9i{u) 
{hit^^\au) =: -{n + l)gY\u) 

{hi)'u{ccu)c^u-{hi)u{ccu) =•■ ^aiiu); (4.4.19) 

this yields the expansion (4.4.7). According to (4.4.5), the regularization functions 
have the low-energy limit (4.4.10). We finally consider the components j = 2 and 3. 
According to (4.4.5), these components are identically equal to zero in the low-energy 
limit. But for u ~ Ep, they might behave similar to Pg or Pi. To be on the safe side, 
we choose no = 1. Denoting the regularization functions by 

(/^2/3)^"iK) =: g^^iu) 

-{h20)u{ocu) =■■ ^-^^g2/3{u), (4.4.20) 

we obtain the expansion (4.4.8). According to (4.4.5), the regularization functions 
5^ and 62/? vanish in the low-energy region, (4.4.11). 

For clarity, wc point out that choosing hq = 1 (as in (4.4.7, 4.4.8)) is a generaliza- 
tion of setting no = (as in (4.4.6)), obtained by taking into account more summands 
of the expansion (4.3.21). Nevertheless, the different behavior in the low-energy region 
(4.4.14, 4.4.17) suggests that (4.4.7) and (4.4.8) should not be merely more general 
formulas than (4.4.6), but that the behavior of Pj{s,l), j = 1,2,3, should be really 
different from that of Ps{s,l). We shall now make this difference precise. Compar- 
ing (4.4.14) and (4.4.17) (and using that /i2/3 vanishes in the low-energy region), it is 
reasonable to impose that there should be a constant eghcar > with 

\{hj)u{au)\ < Eshear \ {hs)u{c^u)\ for u Ep and j = I, 2, or 3. (4.4.21) 

In view of (4.4.14) and (4.4.17), eghear should be as small as 

2 

TTl 

eshear ~ ■ (4.4.22) 

Ep 

However, if the surface states have shear (as defined earlier in this section), the constant 
ffshear must in general be chosen larger. In order to keep our analysis as general as 
possible, we will not specify here how £shear scales in the Planck energy, but merely 
assume that m? /E^, < eghcar < 1- Using (4.4.15), (4.4.19) and (4.4.20), the condition 
(4.4.21) can be expressed in terms of the regularization functions gj and bj as 

(b \ ^2/3 

-+au]gi, ~ 9-2fi < ^shear u gs for u ~ Ep. (4.4.23) 
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It is interesting to discuss what the condition (4.4.21) means for the functions Pj. 
We begin with the case without regularization. In this case, the vector component of 
P{x, y) points into the direction y — x, more precisely P{x, y) = i{y — x)j^^ S{x, y) with 
a scalar distribution S. In a composite expression like the closed chain y) P{y, x), 
one can contract the tensor indices and obtains in a formal calculation P{x, y)P{y, x) = 
{y — x)'^ S{x, y) S{y, x) with a scalar factor {y — x)'^ which vanishes on the light cone. 
Let us consider this contraction in our light-cone coordinates. Before the contraction, 
each factor {y — x)j^^ = 21 7* + 257' ~ 2/7* is, if we take only the leading contribution 
on the light cone (i.e. the lowest order in s/l), proportional to I. After the contraction, 
however, the product (y — x)"^ = Als is proportional to both I and s. Thus the contrac- 
tion yields, to leading order on the light cone, a dimensionless factor s/l. While the 
factor l"^ changes the scaling behavior in the "large" variable, the factor s tends to 
make the composite expression "small" near the light cone. The analysis of the scaling 
behavior in I can immediately be extended to the case with regularization by looking 
at the expansions (4.4.6) and (4.4.7). Let us consider as an example the leading term 
of the mass expansion. For the expansion (4.4.6), this is the summand k = 0, and 
it scales like Ps{s,l) l/l. If we assume that (4.4.21) holds with ffghcar according to 
(4.4.22), then (4.4.23) shows that b{u) ^ 1, and the summands in the square bracket in 
(4.4.7) are of comparable size. Hence the leading term of the expansion (4.4.7) is also 
the summand k = 0, and it scales in / like Pi(s,l) ~ 1//^. Hence the leading term of 
the sum Pi+^'^ Ps behaves like P ^ \/l + 0{\/l'^). Since s and / are null directions, a 
contraction of the tensor indices in the closed chain leads only to mixed products of the 
form Ps Pi., and this mixed product scales in I like PgPi ~ l/l^. Thus, exactly as in the 
case without regularization, the contraction of the tensor indices yields an additional 
factor If on the other hand, the condition (4.4.21) were violated, the regulariza- 
tion function h could be chosen arbitrarily large. But if h becomes large enough, the 
cominant contribution to (4.4.7) is the summand k = \ (notice that h does not appear 
in the summand k = 0), and hence P/(s, I) ~ l/l. This implies that Pg Pi ~ 1//^, and 
the contraction does no longer yield an additional factor l~^. This consideration is 
immediately extended to the components P2/3 by considering the Z-dependence of the 
summands in (4.4.8). We conclude that the condition (4.4.21) with ffghcar ^ 1 means 
that the contraction of the tensor indices yields a scalar factor which is small on the 
light cone. We refer to this condition by saying that the vector component is null 
on the light cone. If one wishes, one can simply take this condition as an additional 
assumption on the fermionic projector of the vacuum. However, the property of the 
vector component being null on the light cone also arises in the study of composite 
expressions in the fermionic projector as a compatibility condition and can thus be 
derived from the equations of discrete space-time (see Remark 6.2.4)). 

The next question is if our regularization functions a, gj, g^ and 6, which appear 
in our expansions (4.4.6-4.4.8), are all independent of each other, or whether there 
are some relations between them. Recall that the regularization functions are given in 
terms of the boundary values of the functions d^hj{u,v), n > 0, on the curve (u, a„) 
(see (4.4.15, 4.4.19, 4.4.20)). Since the (/ij)j=s,/,2,3 were treated in our two-dimensional 
Fourier analysis as four independent and (apart from our regularity assumptions) arbi- 
trary functions, we can certainly not get relations between the regularization functions 
by looking at the situation in the uv-plane. But we can hope that when we consider 
the surface states in four-dimensional momentum space (as introduced in §4.3), the 
geometry of the hypersurface defined by these states might yield useful restrictions for 
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the regularization functions. First of all, we mention that our discussion of surface 
states of the previous section applies without changes also to the vector component; 
wc will in what follows make use of the restriction to surface states. Since in the 
low-energy region the regularization is irrelevant and the results of §2.5 apply, we can 
furthermore restrict attention to large energy and momentum w, |A;| '-^^ Ep. We choose 
polar coordinates {uj,k = \k\,'d,ip) in momentum space and introduce the "mass shell 
coordinates" 

U = -\k\-u, V=\k\-co. (4.4.24) 

Notice that, in contrast to the coordinates u and v, (4.3.6), the variables (4.4.24) are 
the spherically symmetric part of a four-dimensional coordinate system {U,V,'d,(p). 
Extending also the notation (4.4.4) in a spherically symmetric way, we introduce the 
Dirac matrix 

7^ = 

Let us consider what the expansions (4.4.6-4.4.8) tell us about the surface states. 
Similar as explained before (4.3.30), the discontinuities of hj come about in (4.4.2) 
when the plane {u, v) = const meets and is tangential to the hypersurface of the surface 
states. We denote the tangential intersection point of the surface {u, v) = const with 
the hypersurface by Q = {U,V,'d,(p). In the high-energy region under consideration, 
the variable U is of the order Ep. The variable V, on the other hand, will be of 
order a{U) < amax- Thus our hypersurface is close to the mass cone in the sense 
that V/U ~ (Xmax/Ep <^ 1. As a consequence, the coordinate i? of the intersection 
point Q must be small (more precisely, ?? < y^amax/Ep), and we conclude that, to 
leading order in a^^x^/Ep, V = a{U). Hence we can write the hypersurface as a graph 
V = A{U, -d, If) with a function A satisfying the condition 

A{U, = 0) = a{U) + (higher orders in a^^y^/Ep) . 

One can think of the function A{u,d, ip) as the extension of a to the four-dimensional 
setting. In order to determine the structure of the Dirac matrices, we first recall that 
the assumption that the vector component is null on the light cone implied in our 
consideration after (4.4.6) that the parameter no corresponding to Pi, P2 and P3 was 
equal to one. This means that to leading order in cCmaxZ-Sp, only the function hs{u, v) 
is discontinuous on the curve (n, a„), and we conclude that the distribution P is on 
the hypersurface at the point Q a scalar multiple of 7*; we use the short notation 
P{Q) ~ 7*. Using again that is small, we obtain that to leading order in amax/Ep, 
P{U, A{U, 7? = 0), = 0) ~ 7"^. Since the spatial direction of the vector y — x in (4.1.3) 
can be chosen arbitrarily, we can by rotating our coordinate system immediately extend 
this result to general ■!? and 99, and obtain that P(U, a{U, t9, ip), i}, (p) ~ 7'^. Hence the 
surface states are described by a contribution to P of the form 

-327r3 g{U, i?, cp) 7^ 5{V - A{U, ^, 'p)) + (higher orders in a^^jEp) (4.4.25) 

with some function g. It is reasonable to assume that the functions in (4.4.25) are 
sufficiently regular. Similar to our regularity condition (4.3.12) for /i, we here assume 
that the derivatives of A and gs have the natural scaling behavior in Ep. More 
precisely, for all 721,722,723 > there should exist a constant c ^ lEp with 

\d^d';^d^A(U,d,p)\ + \ff^^d^^d;^g{U,dM\ < cEp^^ max(|A| + |5|) (4.4.26) 
for ah U ^ Ep. 
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The form of the surface states (4.4.25) allows us to calculate the regularization 
functions gj, and bj. For this, we first represent the matrix 7'^ in (4.4.25) in 

the Dirac basis (7-')j=s.L2,3; this yields the contributions of the surface states to the 
distributions (vj /). By substituting into (4.4.2) and carrying out the integrals over 
P2 and ps, one obtains the functions hj. Finally, the regularization functions can be 
computed via (4.4.15, 4.4.19, 4.4.20). This whole calculation is quite straightforward, 
and we only state the main results. To leading order in v/u, we can take A and g as 
constant functions, and thus the calculation of j'^hg + 7^/1/ reduces to the integral 

/ dp2 / dp3 (7" + - 7') g{u,-& = 0) S(v - a„ - ^ 

+(higher orders in v/u, amax/-£^p) • 

An evaluation in cylindrical coordinates yields that both gs{u) and gi{u) are equal to 
g{u, 1^ = 0), and we thus have the important relation 

gs{u) = gi{u) =: g{u) . (4.4.27) 

In the case without shear of the surface states, this relation was already found in 
(4.4.13); we now sec that it holds in a much more general setting. The calculation of 
the angular components j = 2, 3 gives for g<2fi contributions proportional to and 
u d2fi,9- Unfortunately, this is not very helpful because we have no information on the 

derivatives of A and g. The computation of the regularization functions g^^ involves 
higher derivatives of the functions in (4.4.25) and becomes quite complicated. We 
remark that the above analysis of the surface states can be carried out similarly for the 
scalar component of the previous section and gives relations between the regularization 
functions h and h^^\ (4.3.25), but these relations all depend on unknown details of the 
geometry of the hypersurface. We thus conclude that (4.4.27) is the only relation 
between the regularization functions which can be derived with our present knowledge 
on the surface states. 

We finally mention two assumptions on the regularization which, although we 
will not use them in the present work, might be worth considering later. The first 
assumption is related to the fact that P should as a projector be idempotent, = P. 
A formal calculation using (4.1.3) and (4.1.5) yields that 

{P^){x,y) = J -0^ P{p)^ e-'P^^-y^ with (4.4.28) 

Pipf = {2mvjip)l' + {vj{p)v^ip) + 4>{pf))fipf- (4.4.29) 

In order to make sense out of (4.4.29), one must regularize in momentum space, e.g. by 
considering the system in finite 3- volume and take a suitable limit. Since the results of 
this analysis depend sensitively on how the regularization in momentum space is carried 
out, (4.4.29) cannot give any detailed information on the functions 4>, v, or /. The 
only simple conclusion independent of the regularization is that the scalars multiplying 
the factors vj'j^ in (4.1.5) and (4.4.29) should have the same sign, and thus 4>{p) f{p) 
should be positive. According to (4.3.4), this implies that the regularization function 
h be positive, 

h{u) > for all u. 
This assumption is called the positivity of the scalar component. The second assump- 
tion is obtained by considering the rank of P(p). The 4x4 matrix (^ + m) in the 
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integrand of the unregularized fermionic projector (4.1.1) has the special property of 
being singular of rank two. This means that the fermionic projector is composed of 
only two occupied fermionic states, for every momentum p on the mass shell. The 
natural extension of this property to the case with regularization is that for every p 
on the hypersurface defined by the surface states, the matrix P{p) corresponding to 
the vector-scalar structure (4.1.5) should be of rank two. We refer to this property as 
the assumption of half occupied surface states. In terms of the functions /i(n, v) and 
hj(u,v), it means that hs(u,a{u)) hi(u,au) = h{u,au)^- Using (4.3.25, 4.4.15, 4.4.19, 
4.4.27), the assumption of half occupied surface states yields the following relation 
between the regularization functions of the scalar and vector components, 

{a{u) u + b{u)) g{uf = h{uf . (4.4.30) 

4.5. The General Formalism 

In this section we shall extend our previous analysis on the light cone in three 
ways: to the case with interaction, to systems of Dirac seas as introduced in §2.3 
and to composite expressions in the fermionic projector. Oiir first step is to develop a 
method which allows us to introduce a regularization into the formulas of the light-cone 
expansion (2.5.45). We here only motivate and describe this method, the rigorous justi- 
fication is given in Appendix D. Since the formulas of the light-cone expansion involve 
the factors T^, (2.5.43, 2.5.42, 2.5.40), we begin by bringing these distributions into a 
form similar to our expansion of the regularized scalar component (4.3.26). By partly 
carrying out the Fourier integral (2.5.40) in the light-cone coordinates introduced in 
§4.3 (see (4.3.5, 4.3.6)), we can write the distribution Ta as 

This formula can be regarded as a special case of the expansion (4.3.20) (notice that the 
function h{u,v) corresponding to Ta is computed similar to (4.3.8)), but (4.5.1) holds 
also away from the light cone. The distribution Ta is not diffcrentiable in a at a = 0, 
as one sees either directly in position space (2.5.41) or equivalently in (4.5.1), where 
formal differentiation leads to a singularity of the integrand at n = 0. We bypassed this 
problem by working instead of Ta with the distribution Ta^^ (2.5.42). Let us briefly 
consider what this "regularization" means in the integral representation (4.5.1). The 
formal a-derivative of (4.5.1), 



d _ 1 f°° I 



du 

lo 

is well-defined and finite for a 7^ because of the oscillatory factor exp(— mZ/u). 

However, the limit a -— > leads to a logarithmic divergence. Thus one must subtract 
a logarithmic counterterm before taking the limit; more precisely, 



1 



— e B('u) — (1 -I- logo) (5(u) 



du . 



The higher a-derivatives T^'"\ n > 1, are defined similarly using suitable counterterms 
which arc localized at u = 0. Since we do not need the details in what follows, we 
simply write 

1 /"OO / 1 \ reg 
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Consider a summand of the light-cone expansion (2.5.45), 

(phase-inserted Hne integrals) T^'^\s,l) . (4.5.3) 

According to our assumption of macroscopic potentials and wave functions described 
in §4.1, we shall regularize only the distribution T^^\ keeping the iterated line integral 
unchanged. Let us briefly analyze what this assumption means quantitatively. Not 
regularizing the iterated line integral in (4.5.3), denoted in what follows by F, will 
be a good approximation if and only if F is nearly constant on the Planck scale. In 
other words, not regularizing F is admissible if we keep in mind that this method can 
describe the regularized fermionic projector only modulo contributions of the order 
djF/Ep. In the case that this last derivative acts on the bosonic potentials and fields 
contained in we obtain the limitation already mentioned in §4.1 that energy and 
momentum of the bosonic fields should be small compared to the Planck energy. More 
precisely, we can describe the fermionic projector only to leading order in (Zmacro^p)"^) 
where Zmacro is a typical length scale of macroscopic physics. A point we did not pay 
attention to earlier is that the iterated line integrals also involve factors {y — x) which 
are contracted with the bosonic potentials and fields. Thus in light-cone coordinates, 
F will in general contain factors of Z. If the derivative in djF acts on a factor Z, this 
factor is annihilated. Hence keeping the iterated line integrals in (4.5.3) unchanged 
can describe only the leading order in (lEp)~^ of the fermionic projector. We conclude 
that the assumption of macroscopic potentials and wave functions is justified if and 
only if we restrict our analysis to the leading order in {lEp)~^ and {lmacroEp)~^ . We 
remark that going beyond the leading order in {lEp)~^ or (/macro-E'p)"^ would make 
it impossible to describe the interaction by classical fields and is thus at present out 
of reach. 

The restriction to the leading order in {lEp)~^ is a considerable simplification. 
First of all, we can neglect all regularization expansions (which are just expansions in 
powers of {lEp)~^; see e.g. (4.3.27) and the discussion thereafter), and thus we do not 

need to consider the regularization functions ^["l and gj"' . Next we compare for given 
k the summands in (4.4.6-4.4.8) (the analysis for fixed k is justified assuming that the 
vector component is null on the light cone; see (4.4.21) and the discussion thereafter). 
One sees that the tensor index j = s gives the leading contribution in {lEp)~^ to the 
vector component. This is a great simplification when tensor indices are contracted 
in composite expressions. Namely, when the vector component is contracted with the 
bosonic potentials or fields, it suffices to consider the contribution Pg, (4.4.6). If vector 
components are contracted with each other, the products of type P2/S are according 
to (4.4.6-4.4.8) of higher order in {lEp)~^ or Sghear than corresponding products of type 
Pg Pi- Hence in such contractions, we must take into account both Pg and Pi, but we 
can again neglect the components P2 and P3. We conclude that the only regularization 
functions which should be of relevance here are those appearing in (4.3.26) and in the 
mass expansions of (4.4.6) and (4.4.7), i.e. the four functions 

a{u) , g{u) , h{u) and h{u) (4.5.4) 

with g given by (4.4.27). 

Under the assumption of macroscopic potentials and wave functions, it suffices to 
regularize the factor T^"^ in (4.5.3). Our method for regularizing T^") is to go over 
to the integral representation (4.5.2) and to insert the regularization functions (4.5.4) 
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into the integrand. The procedure depends on whether the contribution to the Hght- 
cone expansion is of even or odd order in the mass parameter m. Furthermore, we 
must treat the factors [y — x)j^i^ in the hght-cone expansion separately. The precise 
rcgularization method is the following. 

Regularization of the light-cone expansion: A summand of the light-cone expan- 
sion (2.5.45) which is proportional to mP , 

vniP (phase-inserted line integrals) T^^\s,l) , (4.5.5) 

has the regularization 

(—1) (phase-inserted line integrals) (4.5.6) 

J_oo \«"y I g{u)a{u)2 for ^? even 

-I- (rapid decay in I) -\- (higher orders in {lEp)~^, (Zmacro-E'p)~S eghear) ■ (4.5.7) 

A contribution to the light-cone expansion (2.5.4-5) which is proportional to and 
contains a factor {y — x)j^^ , 

niF (phase-inserted line integrals) (y — x)j^^ T^'^\s, I) , (4.5.8) 
is properly regularized according to 
(—1) (phase-inserted line integrals) 



:{-ilY-^ j 



00 

du 

00 



1 \ reg / N reg / 1 ^ 



{p — 1 
hiu) a(u)^~ for p odd , / . -i . • 2 3\ 

^ ^ ^ E + (contributions ~ 7 , 7 ) 

g(u) a(u) 2 for p even 

+ (rapid decay in /) + (higher orders in (lEpy^, (/macro-E^p)~\ Sshear) • (4.5.9) 

In these formulas, the regularization function a is given by 

a{u) = ua{u) , (4.5.10) 

^shear is defined via (4.4-^Vj o,nd Imacro is a macroscopic length scale. 

Let us briefly explain and motivate this regularization method (see Appendix D 
for the derivation). First of all, we note that, after writing the factor {y — x)jj^ 
together with the iterated line integrals, the expression (4.5.8) is of the form (4.5.5), 
and the regularization rule (4.5.7) applies. Thus (4.5.9) is an extension of (4.5.7) 
giving additional information on the ^-component of the factor (y — x)j'y^ . As we 
shall see later, this information is essential when the factor {y — x)j in (4.5.8) is to 
be contracted with another factor (y — xy in a composite expression. To explain 
the formula (4.5.7), wc first point out that the expansions of the scalar and vector 
components (4.3.26-4.3.28, 4.4.6, 4.4.7) do not involve the mass parameter m. The 
reason is that m was absorbed into the regularization functions g, h and a, as one 
sees by considering the low-energy limit; see (4.3.29, 4.4.9, 4.4.10). Furthermore, we 
note that each contribution to the mass expansions of the scalar or vector components 
contains either a factor h or g (see (4.3.26, 4.4.6, 4.4.7)), and it is therefore reasonable 
that we should also use exactly one of these factors here. As a consequence, the power 

in (4.5.5) uniquely determines how many factors of each regularization function we 
should take. Namely for even p, we must take one factor g and p/2 factors a, whereas 
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the case of odd p gives rise to one factor h and {p — l)/2 factors a. On the other 
hand, we know that the insertion of the regularization functions into (4.5.2) should 

modify the behavior of the integrand only for large u ~ Ep; in particular, wc should 
for small u have a behavior ~ u~'"'. In order to comply with all these conditions, one 
must insert the regularization functions precisely as in (4.5.7). In order to motivate 
(4.5.9), we consider the expansion of the vector component (4.4.6, 4.4.7). Recall that 
the regularization function b vanishes in the low-energy region (4.4.10) and describes 
the shear of the surface states (as explained after (4.4.13)). Since this effect is not 
related to the mass of the Dirac particle, it is plausible that we should not associate 
to 6 a power of m. For the mass expansion of the vector component, we should thus 
collect all terms to a given power of a. The contribution ~ to j^Ps + ^''Pi takes 
according to (4.4.6, 4.4.7) the form 

1 {-ilf p / . k-l I 6 A ^ 
il k\ 7_oo V " / 

In order to obtain the correct behavior in the low-energy region, wc must multiply this 
formula by —21 and choose k = n + \. This explains the form of the square bracket in 
(4.5.9). The combination of the regularization functions g, h and a in (4.5.9) can be 
understood exactly as in (4.5.7) using power counting in m. 

Our constructions so far were carried out for the case iV = 1 of one Dirac sea. We 
will now generalize our regularization method to systems of Dirac introduced 
in (2.3) and will also introduce a compact notation for the regularization. Exactly 
as in §2.5 we only consider the auxiliary fermionic projector, because the fermionic 
projector is then obtained simply by taking the partial trace (2.3.20). We first outline 
how chiral particles (e.g. neutrinos) can be described. Without regularization, a chiral 
Dirac sea is obtained by multiplying the Dirac sea of massless particles with the chiral 
projectors Xl/r = ^ i^Tp)', for example in the vacuum and left/right handed particles, 

P{p) = XL/R^5(/) e(V) • (4.5.11) 

The most obvious regularization method is to deduce the regularized chiral Dirac sea 

from a Dirac sea regularized with our above methods again by multiplying from the 
left with a chiral projector. This simple method indeed works, under the following 
assumptions. First, we must ensure that the regularized fermionic projector of the 
vacuum is a Hcrmitian operator. To this end, wc must assume that the scalar com- 
ponent (f) in (4.1.5) be identically equal to zero (this generalizes the requirement of 
massless particles needed in the case without regularization). Hence we regularize 
(4.5.11) by setting 

Pip) = XL/RVj{pW f{p) ■ 
The expansions near the light cone arc then obtained from (4.3.27, 4.3.28) and (4.4.6- 
4.4.8) by setting the scalar regularization functions h and /i^"] to zero and by mul- 
tiplying with Xl/r- Assuming furthermore that the bosonic potentials are causality 
compatible (see Def. 2.3.2), the formulas of the light-cone expansion are regularized 
likewise by taking the regularizations (4.5.7, 4.5.9) with h set identically equal to zero, 
and by multiplying from the left by a chiral projector Xl/r- 

We next consider the generalization to systems of Dirac seas. In the vacuum, 
we can describe a system of Dirac seas by taking a direct sum of regularized Dirac 
seas and by using instead of the chiral projectors Xl/r the chiral asymmetry matrix 
X (see (2.3.7)). Since we may choose the regularization functions for each Dirac 
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sea independently, this procedure clearly increases the total number of regularization 
functions. However, it is natural to impose that the regularization should respect all 

symmetries among the Dirac seas. More precisely, if the fermionic projector of the 
vacuum contains identical Dirac seas (e.g. corresponding to an underlying color SU (3) 
symmetry), then we will always use the same regularization functions for all of these 
Dirac seas. Once the regularization has been specified for the vacuum, we can again 
apply the rules (4.5.5 4.5.9) to regularize the light-cone expansion. In the special case 
that the bosonic potentials are diagonal in the Dirac sea index, we can simply take the 
direct sum of the contributions (4.5.7, 4.5.9), using in each summand the regularization 
functions of the corresponding vacuum Dirac sea. In the general case of a non-diagonal 
bosonic field, the regularization functions can be inserted uniquely if one uses that, 
according to the assumption of macroscopic potentials and wave functions of §4.1, the 
fermionic projector is modified by the bosonic fields only on the macroscopic scale, 
so that its microscopic structure is the same as in the vacuum. For example, one 
can in the case of a gravitational and Yang-Mills field make the bosonic potential 
locally to zero by transforming to a suitable coordinate sytem and gauge, can in this 
system insert the regularization functions as in the vacuum and can finally transform 
back to the original system. We conclude that the generalization of our regularization 
method to systems of Dirac seas is quite straightforward and canonical. Therefore we 
can introduce a short notation for the rcgularizations of the factors T^") in the light- 
cone expansion by simply adding a label for the order in the mass parameter. More 
precisely, we introduce in the case A'^ = 1 of one Dirac sea the following abbreviations 
for the Fourier integrals in (4.5.7) and (4.5.9), 



J —oo 



-ius 



J h{u) a{u) 2 for p odd 
g{u) a{u)^ for p even 



(^T^")) ^ -(-zO"-i Tdne--^ X ( for odd 

j-oo I giu) a{u) 2 for p even 

r / -I \ ree / \ ree ^ 1 ^ ^'^^ 

,n+2 



1 \ reg 



1 \''*'^ 
— + 2Z Ku) 7' 



u 



. . /-oo / 1 \ reg 



b(u) 



p-1 

h{u) a{u) 2 for p odd 
g{u) a{u)2 for p even 



(4.5.12) 



(4.5.13) 



(4.5.14) 



In the case of a system of Dirac seas (2.3.3), we use the same notation for the corre- 
sponding direct sum. With this notation, the regularization of the light-cone expansion 
is carried out (modulo all the contributions neglected in (4.5.7) and (4.5.9)) merely by 
the replacement mP T^"'\x,y) Tj^"-* and by marking with brackets that the factors 

(y — x)j'y^ and Tj^"^ belong together (where we use the abbreviation ^ = y — x) . We 

call a factor ^ inside the brackets (^T/?-*) an inner factor ^ . Notice that the functions 
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T^py in (4.5.14) involve the regularization function b; they wih be needed below to 
handle contractions between the inner factors. 

We finally come to the analysis of composite expressions in the fermionic projec- 
tor. In §4.2 we already discussed the simplest composite expression, the closed chain 
P{x,y) P{y,x) in the vacuum. In order to analyze the closed chain near the light 
cone, we substitute for P{x, y) and P{y, x) the regularized formulas of the light-cone 
expansion and multiply out. It is convenient to use that the fermionic projector is 
Hermitian and thus P{y,x) = P{x,y)*; hence the light-cone expansion of P(y,x) is 
obtained from that for P{x, y) by taking the adjoint (with respect to the spin scalar 
product). The iterated line integrals can be multiplied with each other giving smooth 
functions; also we can simplify the resulting product of Dirac matrices using their 

anti-commutation relations. Denoting the adjoints of (4.5.12) and (4.5.13) by 

and {^T^^), respectively, we thus obtain for the closed chain a sum of terms of the 
following forms. 



prp(ni) rp{n2) p rpini)-. rp{n2) 

[ri\_\r2\_ ' ^'^^'l [ri] > N] (4 5 15) 

ft("|) (e;,T("^^) , F(e„r,^"?M62r,("i^) , 

where is a smooth function in x and y and n^, rj are integer parameters. Here 
the tensor indices of the inner factors £^ are contracted cither with each other or with 
tensor indices in the smooth prefactor F. In order to analyze Euler-Lagrange equa- 
tions like for example (3.5.20, 3.5.21), we need to consider more general expressions. 
More precisely, all Euler-Lagrange equations in this book can be written in terms of 
expressions being a product of a smooth function with a quotient of two monomials 

(n) (n) 

in Tj^j and Tj^j , possibly with inner factors ^ in the numerator. Thus our key problem 
is to mathematically handle expressions of the form 



(smooth function) x 



[Sl] [Sf] [Sf+l] ' ' ' [Sg] 



xi^hThh ■ ■ ■ (6 T^^) T^"""^'? ■ ■ ■ 



X (^7.^1 T'f'^^^^) ■ ■ ■ (e,- T,("r^) T^""'^'? ■ ■ ■ (4.5.16) 

with 0</<5, 0<a<6<c<d, parameters lj,Si,ni,pi and tensor indices jj. 
Here the tensor indices of the inner factors are again contracted either with other 
inner factors or with tensor indices in the smooth prefactor. We mention for clarity 
that, since the factors in (4.5.16) are complex functions or, in the case > 1 of 
systems of Dirac seas, direct sums of complex functions, the product (4.5.16) clearly 
is commutative. 

The inner factors in (4.5.16) can be simplified using the particular form (4.5.12, 
4.5.13) of rj[.j^ and (^jT^^l^). We begin with the case of an inner factor which is 
contracted with a tensor index in the smooth prefactor, i.e. with products of the form 



■■■F^^.tI;^)... or ...FMG^'f)--- 

and a smooth vector field F, where "• • • " stands for any other factors of the form as 
in (4.5.16). According to (4.5.13), to leading order in {lEp)~^ it suffices to take into 
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account the s-componcnt, and thus (4.5.12) yields that {^T|^^^) « 21 ^ Since 
21Y coincides on the Ught cone with ^, we conclude that, to leading order in {lEp)~^, 

(^.-T^y ) = F^'C, r(;;) and {J~T^) = F^^j¥^ . (4.5.17) 

These relations coincide with what one would have expected naively. Wc next consider 
the case of two inner factors which are contracted with each other, i.e. products of the 
following form. 



(n2)N 



or 



-,(ni) 



••■(e.-rKO(e^-r£f)--- 



(4.5.18) 



In this case, the product cannot be calculated naively because the factor ^j^-' = 
vanishes on the light cone. But we can still compute the product using the Fourier 
representation (4.5.13). Since the s- and /-directions arc null, only the mixed products 
of the s- and /-components in (4.5.13) contribute, and we obtain 



x(-zz)"2-iy" 



u 



duo 



1 



dui ^ e-*"i^ X 



'1 " 

h{ui) 0,(77,1 )~ 
g{ui) a{ui)~ 



2m2 ^ 21 6(7/2) 



n2+l 



n2+2 



-IU2S 



r2 — 1 

h{u2) a{u2)^~ 



-c 



dui 



2mi 



+ 



y g{u2) 0(7/2) 
21 b{ui 



u 



lUlS 



-( 



du2 — e-'"2" X 



u. 



"2 



ni+2 
1 

rx-l 

h{ui) a(77i)~2~ 
h{u2) 0(77,2 

7 

5(7/2) 0(7/2)" 



for ri odd 
for ri even 



for r2 odd 
for r2 even 



for ri odd 
for ri even 

'•2-1 



for r2 odd 
for r-2 even , 



and similarly for the two other products in (4.5.18). In the case of systems of Dirac 
seas, this calculation can be done for each summand of the direct sum separately. 
Rewriting the Fourier integrals using the notation (4.5.12) and (4.5.14), we get the 
following result. 

Contraction rules: To leading order in {lEp)~^, 

(e.-r[t:;^)(^^T£f) 



-2 t("|) (r/2 r("^+'^ + Ti"\+')) - 2 (m r^"^+') + Ti"\+'h T^"^^ (4.5.19) 
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-2 r("|) (n2 r,^"?+'^ + rj"\+'h - 2 (m r,^"|+'^ + ri"\+'h r,^"^) (4.5.20) 

[ri] ^ ^ [r2] {r2} ' ^ [ri] {n} ' [r2] ^ ' 



V2\ 



= -2 r^"|) (n2 r("i'+'^ + TiX^^) - 2 (m r("[+') + r("\+')) rf^'f . (4.5.21) 

[ri] ^ ^ \ri\ {ri} ' \ ^ [n] {n} / [r2l 

By iteratively applying (4.5.17) and the contraction rules (4.5.19-4.5.21), we can in 
(4.5.16) eliminate all inner factors ^ to end up with products of the form 



(smooth function) — ° ° (4.5.22) 

-to ■■■-^o -to ■■■-'o 

with parameters a, (5 > 1, 7,(5 > and ai,bi,Ci,di (if 7 = = S the denominator 
clearly is equal to one). Here each subscript "o" stands for an index [r] or {r}. The 
quotient of the two monomials in (4.5.22) is called a simple fraction. 

We point out that the above transformation rules for the inner factors (4.5.17) and 
(4.5.19-4.5.21) are identities valid pointwise (i.e. for fixed x and y) close to the light 
cone. We anticipate that Euler-Lagrange equations like (3.5.20, 3.5.21) do not lead us 
to evaluate the products of the form (4.5.16) pointwise, but merely in the weak sense. 
Therefore, we now go over to a weak analysis of the simple fraction. In the case of a 
continuous regularization, we thus consider the integral 



'x r){x) ° " ° _ " ° (4.5.23) 

rp{ci) rp{c-,) rp(dl) rp(ds) 

-to ***-to -to ***-to 



with a test function 77. Before coming to the derivation of calculation rules for the 
integrand in (4.5.23), we must think about how the test function rj is to be chosen. As 
explained in §4.2 in the example of the closed chain (4.2.2), a weak integral in general 
depends essentially on the unknown high-energy behavior of the fermionic projector 
and is therefore undetermined. To avoid this problem, we must evaluate (4.5.23) in 
such a way that our expansions near the light cone become applicable. To this end, we 
assume that rj has its support near the light cone, meaning that in light-cone coordinates 
{s,l, X2, x^), the "large" variable I satisfies on the support of rj the conditions (4.3.10). 
For clarity, we remark that this definition does not state that the support of 77 should 
be in a small neighborhood of the light cone, but merely in a strip away from the 
origin. This is sufficient because we shall extract information on the behavior near the 
light cone by considering the singularities of the integral for Ep — > 00 (see (4.5.29) 
below). Furthermore, we assume that ry is macroscopic in the sense that its partial 
derivatives scale in powers of l~^ or /macro- Under these assumptions, the integrand 
in (4.5.23) is macroscopic in I, and carrying out the s- and /-integrals in (4.5.23) gives 
a function which is macroscopic in the "transversal" variables X2 and 0:3. Therefore, 
in the three variables {l,X2,X3), a weak analysis is equivalent to a pointwise analysis, 
and thus it suffices to consider the s-integral in (4.5.23), i.e. the expression 



J — ( 



rp{ai) rp{aa) rp{hl) rjiibfj) 

1 -to ***-to -to ***-to / A f r\ 4\ 

ds r] = (4.5.24) 

, T^(ci) 7-1(27) rpidl) rpidg) 

-to '''-to -to '''-to 
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for fixed I, X2 and X3. In the case of a discrete regularization, the integral in (4.5.23) 
must be replaced by a sum over all space-time points, i.e. we must consider instead of 
(4.5.23) the weak sum 



E 



xeM ri^i)...ri^7)2;(<ii)...j.K) 

where M C M"^ are the discrete space-time points and 77 is a macroscopic function in 
with support near the light cone. Up to a normalization factor, (4.5.25) can be regarded 
as a Riemann sum which approximates the integral (4.5.23). Assuming that the space- 
time points have a generic position in M'* and keeping in mind that the function inside 
the sum (4.5.25) is macroscopic in the variables I, X2, and X3, the Riemann sum and 
the integral indeed coincide to leading order in {lEp) ^ and (Zmacro 

Ep)~'^. Hence it is 

admissible to work also in the discrete case with the one-dimensional integral (4.5.24). 
Let us analyze the integral (4.5.24) in more detail. We first consider how (4.5.24) 

in) 

scales in the Planck energy. In the limit Ep ^ 00, the factors To go over to distribu- 
tions which arc in general singular on the light cone. Hence their product in (4.5.24) 
becomes ill-defined for Ep 00 even in the distributional sense, and thus we expect 
that the integral (4.5.24) should diverge for Ep 00. The order of this divergence can 
be determined using the following power counting argument. Keeping in mind that 
the regularization functions decay on the Planck scale u ~ Ep, the Fourier integrals 
(4.5.12) and (4.5.14) behave on the light cone (i.e. for s = 0) like 

ri") ~ loga {Ep)E-''+^ 

with g = 1 in the case n = 1 and g = otherwise. Hence the product in the integrand 
of (4.5.24) scales on the light cone as 



o ' ' ' o o ' ' ' o 



-1-0 ' ' ' -i-o -to ■■■-to 



log^ {Ep)E^ (4.5.26) 



with g E Z and 



a ,fl 7 5 

L = a + P-j-5-J2aj-J2bj+J2''j+J2'^j- (^■^■27) 
j=i j=i j=i j=i 

We call L the degree of the simple fraction. We will here restrict attention to the 
case L > 1. In this case, the simple fraction (4.5.26) diverges in the limit Ep 00 
at least quadratically. If s is not zero, the oscillations of the factor exp(—ius) in 

(4.5.12, 4.5.14) lead to a decay of ri"-* on the scale s ^ Ep^. This consideration shows 
that the dominant contribution to the integral (4.5.24) when Ep ^ 00 is obtained by 
evaluating ij on the light cone, and the scaling behavior of this contribution is computed 
by multiplying (4.5.26) with a factor Ep^ . We conclude that (4.5.24) diverges in the 
limit Ep — 00, and its leading divergence scales in Ep like 

r]{s = 0) logS (Ep) E^-^ . (4.5.28) 

Collecting the logarithmic terms in the light-cone expansion, one can easily compute 
the parameter g. We remark that due to possible zeros in the denominator, the 
integral (4.5.24) might diverge even for finite Ep. In this case we can still use (4.5.28) 
if we set the proportionality factor equal to plus or minus infinity. We also note that, by 
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substituting the Fourier representations (4.5.12, 4.5.14) into (4.5.24), one can rewrite 
the products in (4.5.24) in terms of the regularization functions (this is explained in 
detail in Appendix E for a particular choice of regularization functions). Collecting 
the factors of / in (4.5.12) and (4.5.14), wc end up with the following result. 

Weak evaluation near the light cone: Consider the integral (4-5.24) for a simple 
fraction of degree L > 1. Then there is an integer g > and a real coefficient Creg 
independent of s and I such that for every macroscopic test function rj, 



f 



rp{ai) rp{aa) rp{bl) rp{bp) 

ds rj ° " ° 11 " ° = ^n{s = 0) log^iEp) E^-^ 

(ci) (c^) (di) (ds) {ll)^ / a \ J P 

J-o ' ' ' J-o J-o ' ' ' J-o 



(higher orders in (lEp) ^ and (/macro Ep)-^) . (4.5.29) 



The coefficient Creg clearly depends on the indices of the simple fraction and on the 
details of the regularization. We call Creg a regularization parameter. 

Integrals of type (4.5.24) can be transformed using integration by parts. For clarity 
we begin with the special case of a monomial, 

r {ts") = ~r "^'"^Ts (^°"'^ ■ ■ ■ ^) ^"^-^-^^^ 



roo r / 

/ ds r] I 

J —oo L \ 



.ds ° J ° 

° ° ds ° 



(4.5.31) 



where in the last step we applied the Leibniz rule. Differentiating (4.5.12) and (4.5.14) 
with respect to s yields that 

d^n) ^ _lTin-l) dj^) ^ Tin-l) 

as ds 
With these relations, we can carry out the derivatives in (4.5.31). Notice that the 
differentiation rules (4.5.32) decrease the index n by one. According to (4.5.27) and 

(4.5.29), decrementing the upper index of a factor To "' ■^ or To ''^ increments the degree 
of the monomial and yields in the weak integral a factor of the order Ep/l. Using 
furthermore that r] is macroscopic (as defined after (4.5.23)), we conclude that each 
summand in (4.5.31) dominates the left side of (4.5.30) by one order in lEp or ImacioEp. 

We have thus derived the following result. 

Integration-by-parts rule for monomials: Consider a monomial of degree L > 
1. In a weak analysis near the light cone, we have to leading order in {lEp)~^ and 

i^macroEp) , 



= . . . 2^("p) j^^bi) _ _ _ J,{bq) _|_ _ _ _ _|_ ji(ai) . . . rp{ap-l) rp{bl) . . . jt( 

The integration-by-parts method works similarly for simple fractions. For ease in 
notation we state it more symbolically. 

Integration-by-parts rule for simple fractions: Consider a simple fraction of 
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degree L > 1. In a weak analysis near the light cone and to leading order in {lEp) ^ 

and {lmacroEp)~^ , 



rp(ai) rji{aa) rp(bl) rpib/}) 

V I ° " ° _ ° " ° 1=0. (4.5.34) 

rp{ci) _ _ _ rpic^) rp{dl) _ _ _ rp{ds) 

Here V acts on all factors like a derivation (i.e. it is linear and satisfies the Leibniz 
rule), commutes with complex conjugations and 

1 rp(,n-l) 

vri"^ = ri"-') , v-^ - -— — 



The integration-by-parts rule gives us relations between simple fractions. We say that 
simple fractions are independent if the integration-by-parts rules gives no relations be- 
tween them. More systematically, we consider the vector space of linear combinations 
of simple fractions. We say that two such linear combinations arc equivalent if they 
can be transformed into each other with the integration-by-parts rules. We refer to 
the equivalence classes as the basic fractions. Taking the linear combination of the 
corresponding regularization parameters Creg, we can associate to every basic fraction 
a so-called basic regularization parameter. In Appendix E it is shown for all simple 
fractions which will appear in this book that the corresponding basic fractions are 
linearly independent in the sense that there are no further identities between them. 
Therefore it seems a reasonable method to take the basic regularization parameters as 
empirical parameters modeling the unknown microscopic structure of space-time. 

We remark that the notion of the basic fraction can be made more concrete by 
choosing from each equivalence class one representative. Then one can identify ev- 
ery basic fraction with the distinguished simple fraction in its equivalence class. For 
simplicity we give this construction in the special case that the simple fractions are 
monomials of the form 

rp{ai) rp{ap) rp{bl) rp{bq) 

J-o ' ' ' -'-o -to ■■■-'o 

(the construction can immediately be extended to simple fractions, but it becomes a 
bit complicated and we do not need it here). If only one factor Tj"^ appears {p = 1), 
one can by applying the integration-by-parts rule iteratively increment the parameter 
ai; this clearly decreases the other parameters bi,. . . ,bq. In order to avoid that any of 
the parameters bi,. . . ,bq becomes smaller than —1, we stop the integration-by-parts 
procedure as soon as one of the bj equals —1. In this way, we can express every 
monomial as a unique linear combination of monomials of the form 



jnia,) _ _ _ j.{bq) ^.^^ -l = bi<-- - <bq. (4.5.35) 

Similarly for p > 1, the integration-by-parts rule allows us to increment the smallest 
of the parameters aj, unless either one of the parameters bj equals —1 or there are 
two factors T^"-') with aj = min(ai, . . . ,ap). By iteration, we can thus transform any 
monomial into a linear combination of monomials of the following type. 



^-^^ ai<---<ap, bl<---<bq 
and ai = 02 or 61 = — 1 . (4.5.36) 
We can now consider (4.5.35) and (4.5.36) as the basic monomials. 
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With the above constructions we have developed the mathematical framework for 
analyzing composite expressions in the fermionic projector in the continuum. Our 

procedure is outlined as follows. We first substitute for the fermionic projector the 
regularized formulas of the light-cone expansion; this yields sums of products of the 
form (4.5.16), where the smooth prefactor involves the bosonic potentials and fields 
as well as the wave functions of the Dirac particles and anti-particles of the system. 
Applying our contraction rules, we then eliminate all inner factors and obtain terms of 
the form (4.5.22). When evaluated in the weak sense (4.5.29), the /-dependence deter- 
mines the degree L of the simple fraction, and the dependence on the regularization is 
described for each simple fraction by the corresponding regularization parameters Crcg- 
Using our integration-by-parts rule, we can furthermore restrict attention to the ba- 
sic fractions and the corresponding basic regularization parameters. Taking the basic 
regularization parameters as free empirical parameters, the composite expressions in 
the fermionic projector reduce to expressions in the bosonic fields and fermionic wave 
functions, involving a small number of free parameters. This procedure for analyzing 
composite expressions in the fermionic projector is called the continuum limit. 
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